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WEAK COMPACTNESS AND VECTOR MEASURES 


R. G. BARTLE, N. DUNFORD anv J. SCHWARTZ 


Introduction. It is the purpose of this paper to develop a Lebesgue theory 
of integration of scalar functions with respect to a countably additive measure 
whose values lie in a Banach space. The class of integrable functions reduces 
to the ordinary space of Lebesgue integrable functions if the measure is scalar 
valued. Convergence theorems of the Vitali and Lebesgue type are valid in 
the general situation. The desirability of such a theory is indicated by recent 
developments in spectral theory. 

In §1 two criteria for the conditional weak compactness of subsets of the 
Banach space of countably additive measures on a ¢-field = are derived. Their 
force is sufficient to allow us to conclude, in §2, that if u is a countably additive 
measure on = with values in a Banach space, then there exists a positive 
scalar measure vy on = with respect to which yu is »-continuous (i.e., v-absolutely 
continuous). This permits the development of the integration theory. 

As an example of an elementary application of the integration theory we 
give, in §3, the ‘‘vector” generalization of the celebrated Riesz theorem on 
the representation of linear functionals on a space of continuous functions. 
This yields representation theorems for the general, the weakly compact, and 
the compact operators on a space of continuous functions. Some of these 
results are related to theorems of Gelfand (6) and Grothendieck (7). 

Our notation and terminology are standard. We mention specifically that 
the weak topology of a Banach space % is the topology induced by its adjoint 
space ¥* in the familiar fashion, even though % itself may be the adjoint of 
some other space. By the ¥ topology of ¥* we mean the topology on ¥* which 
has as a typical neighbourhood of the origin, the set {x* € ¥*| |x*(x,)| < 1, 
p= l,...,2}. 


1. Weakly compact sets of measures. Let S be an abstract set and = bea 
o-field (i.e., c-algebra) of subsets of S; sets in = will frequently be called the 
measurable subsets of S. 


1.1. Derrmition. By ca(Z) we denote the set of all countably additive 
real or complex valued measures defined on = having finite variation. A 
generic element of ca(Z) will ordinarily be denoted by the letter \; positive 
measures by v. The symbol |A|(Z) denotes the total variation of \ over the set 
E € 2%. The symbol |A| represents the total variation of \ over the entire space 
S. 
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With |A| as the norm, ca(Z) forms a real or complex Banach space. It is the 
purpose of this section to characterize the subsets of ca(Z) which are condi- 
tionally weakly compact. We shall make free use of the theorem of Eberlein 
(5) asserting that such sets K are precisely those which are weakly sequentially 
compact in the sense that an arbitrary sequence from K has a subsequence 
which converges weakly to an element of ca(Z). We observe that if a sequence 
{A} converges weakly, then it is bounded and {d,(E)} is a convergent set 
of scalars for each E € 2. 

Finally, we mention that we will make essential use of the Vitali-Hahn-Saks 
theorem (15, p. 967): Let » be a finite positive measure on 2, let {A,} C ca(Z) 
be such that lim \,(2£) exists for each E € 2, and let 

lim A,(Z) =0 
»(B)+0 
for each n. It follows that this last limit exists uniformly in n, and if ) is the set 


function defined by 
A(E) = lim A, (Z£) 


for E € &, then A € ca(2). 


1.2. Lemma. Jf {dx} is a sequence in ca(Z), there is a positive measure 
vy € ca(Z) such that 


lim A,(Z) = 0, i=1,2,. 
*(Z)~0 
Proof. The measure defined by 
=o [hl (E) 
E) = ¢ Ld (EZ) z, 
ME) eT i ile 


has the property that if y(Z) = 0 then |A,/(Z) = 0. The conclusion then follows 
(8, p. 125). 


1.3. THEOREM.' For a set K C ca(Z) to be conditionally weakly compact 
it is mecessary and sufficient that 

(1) the set K is bounded, and 

(2) if {E,} is @ sequence in = which decreases to the void set, then 


lim A(E,;) = 0, 
too 
uniformly for \ € K. 


Proof. The necessity of (1) is a consequence of the uniform boundedness 
theorem. To show that (2) is necessary we proceed indirectly, and suppose 
that there exists a positive number ¢«, a sequence {E,} C = with E, | ¢, 
and a sequence {A,} C K such that 


lAs(E,)| > «, i=1,2,. 


1The reader should observe the similarity between this theorem and some of the criteria, 
established by Grothendieck (7, p. 146), for weak compactness in the space of bounded regular 
Radon) measures. 





= — 
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Since K is assumed to be weakly compact, we may suppose that {A,} is weakly 
convergent so that lim A,(£) exists for all E € 2. Construct » € ca(Z) as in 
the lemma; by the Vitali-Hahn-Saks theorem 

im A,(ZE) = 0, uniformly for i = 1, 2,. 

»(B)' 
Since E; | ¢, it follows that »(Z,) — 0 and hence \,(E,) — 0, which contra- 
dicts the supposition and establishes the necessity of (2). 

To prove the sufficiency, let {,,} be an arbitrary sequence in K. Construct 
a positive measure v € ca(Z) as in the lemma. Let ca(Z;v) be the closed 
linear manifold in ca(Z) consisting of measures \ which are »-continuous, i.e. 
such that ° 
lim A(E£) = 0. 


»(B)~0 
By the Radon-Nikodmy theorem (8, p. 128) there is an isometric isomorphism 
between the space ca(Z, v) and the space L(S, 2, v) of v-integrable functions, 
the correspondence \ «+ f being given by the formula 


X(E) = Spo v(ds), E€ 2%. 


Denote the functions corresponding to the {A,,} by {f,,}. Let {G,|" = 1,2,...} 
be a basis for the open sets in the scalar field and let E,, = fn~' (Ga). 

Let 2, be the o-field generated by the sets {£,,,}. Observe that each f, 
is measurable with respect to 2, and thus is in the space L(S, 2,, v). Let 2, 
be the field generated by the sets {£,,,}; it is known (8, p. 23) that 2, consists 
of a countable number of sets. By a diagonal process, pick a subsequence 


{ rma } 
such that 


lim Am, (EZ) 


exists for all E € 2». Now let =; be the class of all subsets in 2, for which 
lim Am, (EZ) 
k 
exists. Evidently, 22 C 23 € 2. 
We wish to show that 2; = 2, and will accomplish this by demonstrating 


that 2; isa monotone class (8, p. 27). Let {E,} C 2, and suppose that E, f E. 
Then E — E; | ¢, 80 by condition (2) we have that 


lim Ams (Es) - Ams (EZ) 


uniformly with respect to k = 1, 2,.... But since EZ, € 2s, 
lim Ams (Ex) 
k 
exists fori = 1,2,.... It follows that 
lim Am, (Z) 
k 


exists and hence E € 23. Thus 2; is a monotone class and 2; = 2). 
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We now prove that 
{Ams } 


is a weakly convergent sequence in ca(Z). By the Hahn-Banach theorem 
(8a, p. 19) it is enough to prove that this sequence is weakly convergent in 
the space ca(Z;¥v), and in view of the isometric isomorphism between this 
space and L(S, =, v) it suffices to show that 


{fms 


is a weakly convergent sequence in L(S, Z,v) or even in the subspace 
L(S, 21, v). Since this sequence is bounded it is enough to prove convergence 
on a fundamental set of continuous linear functionals on L(S, 2;, v), such as 
the characteristic functions of subsets of 2. But the convergence on this 
fundamental set is equivalent to the convergence of 


{Ams } 


on 2;, which has already been established. This proves the sufficiency of the 
conditions. 
We shall also need the following criterion: 


1.4. THEOREM. For a set K C ca(2) to be conditionally weakly compact it is 
necessary and sufficient that 
(1) the set K is bounded, and 
(2’) there exists a positive vy € ca(Z) such that 
lim A(Z) = 0, 


»(Z)+0 


uniformly for X€ K. 


Proof. To prove the necessity of (2’), we shall show that for any positive e, 
there is a positive 5(€) and a finite set {Ax,...,A,} GK such that if 
lAd(Z) < 6(e) for i = 1,...,p then |A(E)| < « for \ € K. Suppose that this 
statement is false for some ¢. Let A, € K be arbitrary; then there exists a set 
E,€ 2 and ad2€ K such that 


lAs|(E1) < 27", |A2(E))| > «. 

By induction, construct sequences {A,} C K, and {E,} € = such that 
(*) lA (E,) < 2-%, l<qigj<®, 
Aj (Ey)| > €, l<gj< om. 
We shall suppose that the sequence {A,} converges weakly in ca(Z). Construct 
a positive measure v’ € ca(Z) by Lemma 1.2. By the Vitali-Hahn-Saks 

theorem 

lim A,(E) = 0, uniformly for i = 1,2,.... 


»’(E)0 
From the construction of »’ and (*) we have that »’(E,) < 2.2-’. On the other 
hand, lDu1(E,)| > «, which is a contradiction and establishes the validity of 
the statement. Now let « take on the values 2", m = 1,2,..., and let 6(2-*) 


-- 








ww Ww ‘7. 6 


of 
) 


>. 
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be the corresponding numbers and {d,,/1 << p< P,, 1 <n < ©} CK the 
measures whose existence has been assured. Let » be the positive measure in 
ca(Z) defined by 


(E) = ay + Pel) E€2. 


n=l p=1 P,1 + [Dep| ” 
Then if »(Z) < 6(2-") {2" P,(1 + M)}-', where M = sup |A\|, it is evident that 
AeK 


\Anp| (EZ) < 8(2-") (pb = 1,...,P,) and hence |A(E)| < 2“ for all AE K. 

To prove the sufficiency of the conditions, we observe that (2’) implies 
condition (2) of the preceding theorem, so the proof of the theorem is com- 
plete. The preceding proof establishes the partial converse: 


1.5. CoROLLARY. If K is a conditionally weakly compact subset of ca(Z), 
then there exists a positive v € ca(Z) satisfying (2') and such that v(E) < « 
whenever E € = is such that |\|\(E) < «for allX € K. 


2. Integration with respect to a vector measure. Let % be a real or com- 
plex Banach space, and %* its conjugate space. We shall be concerned with a 
fixed additive set function » defined on a o-field = of subsets of a set S and 
taking values in %. 


2.1. DermniTiIon. The additive set function yp: = — % is called a vector 
measure if x*u € ca(Z) for every x* € ¥*; thus the set functions {x*y/x* € ¥*} 
on = to the scalar field are finite valued and countably additive. 

If uw is a vector measure, we define a non-negative set function ||u|| on 2, 
called the semi-variation of yu, by 


||a||(Z) = sup | an(Ed)|, E€ Z, 


where the supremum is taken over all finite collections of scalars with |a,| < 1 
and all partitions of E into a finite number of disjoint measurable sets. Evi- 
dently ||u||(Z) < ||u||(F) if Z C F; further, 


il( G,) < Eine 


for every sequence of measurable sets {£,}, and the inequality may be strict. 


2.2. Lemma. Jf uw: 2 —& is a vector measure and if |E,} is a sequence of 
disjoint measurable sets, then 


m( dz.) = > (Ed, 


where the series converges unconditionally in the norm of %. The semi-variation of 
pon S is finite; in fact, if E €  , then 


\|ul|(Z) < 4sup{la(F)|| Fe 2, F CE} < &. 
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Proof. The first fact is due to Pettis (11; 12). Given x* € %*, x*y is a finite 


valued measure, so sup{|x*u(F)|| F€ 2, F C E} is finite for each x* € X*. 
“ty the Uniform Boundedness theorem (8a, p. 26), 


sup{|u(F)|| Fe 2, F CE} < o. 
On the other hand, 


llul(Z) = sup |x'ul(E)<4 sup sup |x u(F)I, 
Ix | <1 Ix |< 1 FCE 
so the conclusion follows from the Hahn-Banach theorem (8a, p. 19). 
The next lemma and its corollary are crucial for our purposes. 


2.3. Lemma. The set of numerical measures {x*u|x* € ¥*, |x*| < 1} is 
conditionally weakly compact as a subset of ca(Z). 


Proof. Let {E,} be a decreasing sequence of sets in = with void inter- 
section. Since » is countably additive in the strong topology, lim u(Z,) = 0 in 
this topology. Consequently, lim x*u(Z,) = 0 uniformly for |x*| < 1. By 
Theorem 1.3 the desired result follows. 


2.4. CoROLLARY. If uw is a vector measure, then there is a positive measure 
vy € ca(Z) such that 


lim p(E) = 0, lim ||uz||(Z) = 0. 
¥(E)+0 


*(E)~0 


The measure v may be chosen such that v(E) < « whenever ||u||(E) < «. 


Proof. The first equation follows from Lemma 2.3, Theorem 1.4, and the 
Hahn-Banach theorem; the second follows from the first and Lemma 2.2. The 
final statement is a consequence of Corollary 1.5. 


We now proceed to develop a theory of integration of scalar functions with 
respect to the vector measure yu. A y-null set isaset E € = for which ||y||(Z) =0; 
the term y-almost everywhere refers to the complement of a u-null set. From the 
countable sub-additivity of ||u|| it follows that the union of a countable num- 
ber of u-null sets is also a yu-null set. A scalar function f defined on S is measur- 
able if for every Borel set B of scalars, f-'(B) is an element of 2. A function 
is simple if it is a finite linear combination of characteristic functions of 
measurable sets. 

If f is the simple function 

Do a Xe. E, € &, 


then we define the integral of f over a set E € = by the equation 


J 0) was) = Dam(EN Bd. 


— 7. 
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The integral of a simple function is independent of the representation of the 
function as a linear combination of characteristic functions. Obviously, inte- 
gration of simple functions over E is a linear operation. Also, the integral of a 
simple function is a countably additive set function with values in %. If f is a 
simple function and if E is in = then evidently 


| Sse mas 


If f is an arbitrary measurable function, we define the y-essential supremum 
of f on E to be the infimum of those numbers A for which {s € E| \f(s)| > A} 
is a w-null set. If 





< sup [f(s)]-|lul|(). 


u-ess sup |f(s)|< @, 
then we say that f is u-essentially bounded on the set E € =. 


2.5. DEFINITION. A measurable function f is said to be yu-integrable if 
there exists a sequence {f,} of simple functions such that 
(1) f,(s) f(s) u-almost everywhere, and 


(2) the sequence 
{ f(s) atasy} 


converges in the norm of ¥ for each E € &. 
The limit of this sequence of integrals is defined to be the integral of f with 
respect to uw over the set E € Z, in symbols 


J f(s) w(@s). 


2.6. THEOREM. (a) Jf E€ Land f is u-integrable, the integral of f with respect 
to » over E is an unambiguously defined element of %; 

(b) the integrable functions form a linear space and for E in = the integral 
ff) u(ds) is a linear map of this space into %: 

(c) if f is a measurable function which is y-essentially bounded on E, then f is 
u-integrable and 








J J) meas) 


(d) if f is u-integrable, then f(s) u(ds) is a countable additive function on = 
to X; 
(e) if f is u-integrable, then 


lim lle = 0; 
||u||(Z)-0 JE 


(f) if U is a bounded linear operator from & into a second Banach space 9), then 
Up is a vector measure from = to ¥) and for any y-integrable function f and 


E€ =, we have . 
v\ Js weasyt = f ss) Tutas). 


< [u-ess sup If(s)|} tle |(2)}; 
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Proof. To prove (a), let {f,} and {g,} be two sequences of simple functions 
as in Definition 2.5; we are required to show that the two sequences of integrals 
approach the same limit. We define 4,(s) = 0, if s is a point at which either 
{fa(s)} or {ga(s)} fails to converge to f(s), and set h,(s) = f,(s) — ga(s) 
otherwise. It is evident that {h,} converges to zero everywhere, and that 
{f p/n (s) u(ds) } converges in the norm of ¥ for E € 2. We must show that this 
sequence of integrals converges to the zero element of &. 

Let »v be a positive measure described in Corollary 2.4. Clearly, since each 
h, is a simple function, 
- lim | hy(s) w(ds) = 0, n= 1,2,.. 

Ez 


»(B)30 = 
further, the sequence of integrals {f pita (s) u(ds)} converges for each E€ 2, 
so by the Vitali-Hahn-Saks theorem,? the limit in (*) is uniform in ”. Conse- 


quently, for each « > 0 there exists a 6 = 4(e€) > 0 such that if A € = and 


v(A) < 6 then 
| Stats) mds) 


By Egoroff’s theorem (8, p. 88) there exists a set A € = with »(A) < 6 such 
that {h,(s)} converges to zero uniformly for s € S — A. Having specified « 
and chosen 6 = 6(e) as above, there exists an N = N(e) such that if 2 > N, 
then |h,(s)| < «for s€ S — A. Hence if n > N, 


Gro nas] < | fn gins) atas)| + | fi gha(s) ads 
< «\lall(S) + « 


uniformly for E € 2. Thus the integral is well defined. 

Statements (b) and (c) follow readily while (d) and (e) follow from their 
validity for simple functions, the Vitali-Hahn-Saks theorem, and Corollary 
2.4. The first assertion in (f) follows from Definition 2.1. and the observation 
that y*Up = (U*y*)y, y* € Y*. The second assertion follows from its validity 
for simple functions. 





<6, 3 ee 

















2.7. THEOREM. Let {f,} be a sequence of u-integrable functions which con- 
verges u-almost everywhere to f. Then f is y-integrable if 


lim fa(s) u(ds) = 0 
||u||(Z)0 JE 
uniformly forn = 1,2,.... In this case we have 


Js (ds) = tim f f(s) u(ds). 


*That this theorem remains valid for vector measures may be seen by examining the proof 
in Saks (15), and has been proved directly by Alexiewicz (1, p. 19) from more general con- 
siderations. 


— 


—_— 
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Proof. Let k be a fixed positive integer and let 5, > 0 be such that if E € = 
and ||y||(Z) < 8, then 





(1) | J fats) w(as) <2", n= 1,2,.... 
This implies that if ||u||(Z) < &, then 
(2) J ats) \x"ul(ds) < 16-2, |x| <1, n= 1,2,.... 


Evidently, we may assume that 6, < 2-*. Let % > 0 be such that if A € 2 
and »(A) < m then ||u||(A) < 6,. By Corollary 2.4, the sequence {f,} con- 
verges v-almost everywhere, and by an application of Egoroff's theorem, we 
can select a set A € = such that »(A) < m and such that the convergence of 
{f,} is uniform on S — A. Thus if E € = and n,m > N,, we have 


(3) | fottuls) — fu(s)} m(ds) 








< | fin Wels) — fats} wlas) 


? | | fn(s) w(ds) 


+ Im (s) u(as)| < 2™*{||u||(S)+2}. 





But since k is an arbitrary positive integer, this proves that the sequence 
{f efa(s) u(ds)} converges in the norm of ¥ for any E € 2. 

We now prove that f is u-integrable. Let 5, and m, have the same meaning 
as in the previous paragraph. Since each f/f, is w-integrable it follows from 
Egoroff’s theorem that there is a simple function g, and a set A, © = with 
v(A,x) < m such that 


(4) \fe(s) — gx(s)| < 2-, s€ S— Ay, 
and 
(5) lgx(s)| < 2/fx(s)], sé 5S. 
Let 

B, = U A; 


so that B, € = and {B,} decreases to the set 


B = Nn B,. 
k=l 
Since 


\lm||(Be) < > I\u||(Ax) < aa <drter™, 


i=k 


it follows that ||u||(B) = 0. Now 


f(s) — ge(s)| < If (s) — fa(s)| + fe(s) — ge(s)|. 


If sé S— B, then s € S — B, for k > K(s) and so (4) holds provided that 
k > K(s). Since {f,} was assumed to converge yu-almost everywhere to f, we 
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conclude that the sequence {g,} converges yu-almost everywhere to f/f. It 
remains to show that the integrals {f gfn(S) w(ds)} converge for E € Z. But 


| fiptils) — gx(s)} uds)| < Sash — gx(s)} u(ds) 


. | Srrvai ne Sena »(ds) 


The integral over E — A, is at most 2-*||u||(S) by (4). Since ||u||(Z O\ Az) <&, 


we have seen in (1) that the second term is at most 2~. In addition, from (5) 
and (2), 

















a 














Sena #@s)| = sup | Sena ee aa 
, jx | <1 ; 
< up fang, leeG)l be l(@s) 
Ix | <1 f , e 
< 2. sup lfe(s)| |x u|(ds) < 32-2~*. 
Ix*| <1 ENA, 
Combining these, we conclude that 
6) § tse) - exe) we@s)| < 2*H1 CS) +33). 





Now if & is given, then combining (3) and (6) we see that if m, m > max(k, N;,) 
and E€ &, we have 


| f tents) — en(s)) m(as)| 
< 2-27*{||ul|(S) + 33} + 2*{||u||(S) + 2} 
< M-2*. 
Hence the sequence of integrals of the simple functions g, converges for E € =, 
so that f is u-integrable. 
To prove the last statement, let h,(s) = f(s) — f,(s), m = 1,2,...; then 


each h,, is w-integrable and, by Theorem 2.6(e), equation (*) in the proof of 
that theorem is valid. The argument there applies in this case to prove that 


J. - 40 was] =o, 


which is the desired conclusion. 


We now show that the theorem on dominated convergence is valid for 
vector measures. 





lim 


2.8. THEOREM. If {f,} is a sequence of u-integrable functions which converges 
u-almost everywhere to f and if g is a u-integrable function such that |f,(s)| < g(s) 
p-almost everywhere, n = 1,2,..., then f is u-integrable and 


$ f00) lds) = tion f fu(s) (ds), Ee 2. 
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Proof. By the preceding theorem, we have only to show that 


lim f,(s) w(ds) = 0 
al 


uniformly for » = 1,2,.... Now by Theorem 2.6(e), given « > 0, choose 
6 > 0 such that if E€ &, ||u||(Z) < é then 


J 206) mlas) 








<«. 
Hence if ||u||(Z) < 6 we have 


Jeo lx w|(ds) < 4e, lx"| <1. 


Consequently, ifm = 1,2,..., and ||u||(Z) < 4, then 


| Spful) lds)| < sup frats)| lx" (ds 
Ix | <1 





< sup fee) lel (ds) < 4c, 
Ix | <1 
from which the conclusion follows. 


While the following theorem will not be used in the sequel, it has some in- 
dependent interest. 


2.9. THEOREM. The range of a vector measure uw : = — & is a conditionally 
weakly compact subset of %. 


Proof. We shall prove that if R = {u(Z)| Z€ =} is regarded as a subset 
of ¥** in the natural embedding, then ® is conditionally compact in the weak 
topology of ¥**. Since the embedding of ¥ in ¥** is closed in this topology, the 
statement will follow. Now we have seen in Lemma 2.3 that the mapping 
U :%* — ca(Z) defined by Ux* = x*u is a weakly compact operator; hence 
the adjoint operator U* : ca*(Z) — ¥** is also weakly compact. But the unit 
sphere of ca*(Z) certainly contains the linear functionals {¢, | E € 2} defined 
by (A) = A(E), A € ca(Z). Clearly U*{oe| EE 2} = {w(ZE)| EE ZF}, this 
latter set being regarded in ¥**, and ® is therefore conditionally weakly 
compact. 


3. Representation of operators on continuous functions. Throughout this 
section S will denote a compact Hausdorff space and = its Borel field, i.e., 
the o-field generated by the closed sets of S. The o-field generated by the closed 
G,sets of Sy is called the Baire field of S and is denoted by 2». The Banach 
space of all continuous scalar valued functions on S with the supremum norm 
is written C(S). The Riesz representation theorem (see Kakutani (10)) asserts 
that the conjugate space of C(S) is isometrically isomorphic to either ca( Zp») 
or to the space R(S) of regular Borel measures on S, where the norm in these 
measure spaces is the total variation over S. 











300 R. G. BARTLE, N. DUNFORD AND J. SCHWARTZ 


Let T be a bounded linear operator mapping C(S) into a Banach space %. 
We say that T is compact, or that it is weakly compact, if T maps bounded sets 
of C(S) into subsets of ¥ which are conditionally compact, or which are con- 
ditionally weakly compact, respectively. We shall make frequent use of the 
fact that T is compact (or weakly compact) if and only if its adjoint 7* is 
compact (or weakly compact). In addition, T is weakly compact if and only 
if its second adjoint 7** maps C**(S) into ¥ (more precisely, if the range of 7** 
is contained in the natural embedding of ¥ in ¥**). For the proofs of the above 
facts, see (2; 4). 

Motivated by the Riesz theorem we are led to inquire when an operator 
T : C(S) — % may be represented by an integral with respect to a vector 
measure—we shall see that this is possible if and only if T is weakly compact. 
First, however, it will be convenient to give a representation of the general 
operator. 


3.1. THEOREM. If T is an operator on C(S) to %, there exists a unique set 
function p : > — ¥** such thai 

(a) w(-)x* € R(S) for each x* € X*; 

(b) the mapping x* — y(.)x* of ¥* into R(S) is continuous with the X and 
C(S) topologies in these spaces, respectively; 

(c) x*7f = f f(s) w(ds)x*, f € C(S), x* € ¥*; 

(d) |T| = ||u\|(S), the semi-variation of u over S. 

Conversely, if u is a set function on = to X** satisfying (a) and (6), then equa- 
tion (c) defines an operator T : C(S) — ¥ with norm given by (d), and such that 
T*x* = p(.)x*. 


Proof. For E in = let n(E) = T**(¢g) where ¢, is that element of C**(S) 
defined by the equation ¢g(A) = A(Z) for \ in R(S). Now the proof proceeds 
along standard lines and we omit the details. 

We now ask when we can be assured that the values of u are contained in &. 


3.2. THEOREM. If T is a weakly compact operator from C(S) to %, then there 
exists a unique vector measure yp on the Borel sets = to X such that 

(a) Tf = J f(s) (ds), f € C(S); 

(b) |7| = ||u||(S); 

(c) T*x* = x*p, x* € X*. 

Conversely, if 4 is a vector measure defined on the Baire sets Xo to ¥ and if T is 
defined by (a), then T is a weakly compact eperator from C(S) to ¥ with norm 
given by (b) and adjoint operator given by (c). 


Proof. lf T is weakly compact, then T7** maps C**(S) into ¥ and thus, 
from the definition of y, it is clear that u(£) is in ¥ for every E in &. It follows 
that x*yu is in R(S) and yz is therefore a vector measure. Thus the integral 
f f(s) u(ds) exists for f in C(S). From Theorem 3.1 it follows that x*7 = T*x* 
= x*u. Thus equation (a) follows from Theorem 2.6(f). Conversely, if u : 2» 


— * = 
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is a vector measure and if T is defined by (a), then 7* maps x* into x*y. By 
Lemma 2.3, T* is weakly compact. Consequently, T is weakly compact. 

The next three results were proved by Grothendieck (7) who used other 
methods. * 


3.3. COROLLARY. If T is a weakly compact operator from C(S) to %, then T 
maps weakly fundamental sequences into strongly convergent sequences. Conse- 
quently, T maps conditionally weakly compact subsets of C(S) into conditionally 
strongly compact subsets of %. 


Proof. If {f,} is a weakly fundamental sequence in C(S), then |f,(s)| < M 
for some number M, and fo(s) = lim f,(s) exists for each s € S, although fo 
may not be in C(S). From Theorem 2.8 we conclude that 


Tha = J fals) u(ds)—> f fols) u(ds) 


in the norm of %. This proves the first assertion; the second follows directly. 


3.4. CoroLLaRY. Jf U:}—>C(S) and T: C(S) - are weakly compact 
operators, then TU : 2) — & is strongly compact. 


3.5. THEOREM.‘ If T is an arbitrary operator on C(S) to X%, and if ¥ is weakly 
complete,® then there exists a vector measure yo defined on the Baire sets Xo of S 
with values in % such that 


Th = J f(s) uolds), f € (5). 


Consequently, an arbitrary operator from C(S) to a weakly complete Banach 
space is weakly compact. 


Proof. Let wu: = — ¥** be the set function whose existence was established 
in Theorem 3.1. We shall show that u maps 2» into X. Let f be a bounded func- 
tion in the first Baire class; the reader may readily verify that there exists a 
bounded sequence {f,} in C(.S) which converges pointwise to f. Then {f,} and 
{7f,} are weakly fundamental; hence {7f,} converges weakly to an element 
of %. If f is regarded as an element of C**(S), it is evident that { 7, } converges 
to T**f in the ¥* topology of ¥**. We conclude that 7**f € ¥. By induction, 
T** maps the bounded Baire functions into ¥; in particular, this is true for the 
characteristic functions of Baire sets. Hence u maps Zp» into ¥. Since the Baire 


%Although Grothendieck (7) did not employ the representation for weakly compact operators 
given above, he noted the one-to-one correspondence between weakly compact operators on 
C(S) to ¥ and vector measures on the Baire subsets of S to ¥. Theorem 3.2 can be proved 
using results in (7) but there is some interest in the measure theoretic approach given here. 

‘Gelfand (6) showed that an operator on C[0, 1] to a weakly complete space can be repre- 
sented as an integral with respect to a vector function of bounded variation. The final conclu- 
sion of the present theorem was announced by Pettis (12); it was also proved by Grothendieck. 

5A Banach space X is weakly complete if every weakly fundamental sequence in ¥ converges 
weakly to an element of %. 
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sets are sufficient to integrate continuous functions, we have the representa- 
tion 


Tf = f f(s) nods), f € (5), 


where yo is the restriction of u to Yo. The second assertion follows from the 
converse part of Theorem 3.2. 


3.6. COROLLARY. An arbitrary continuous linear mapping from an abstract 
M-space to an abstract L-space is weakly compact. 


Proof. The terminology is that used by Kakutani (9; 10). We observe that 
the conjugate of an abstract L-space is an abstract M-space with a unit ele- 
ment and hence is isometrically isomorphic to C(.S) for some compact Haus- 
dorff space S (10, pp. 1023, 998). Further, the adjoint of an M-space is an 
L-space and is therefore weakly complete (10, p. 1021; 9, p. 537). Thus 7* 
and T are weakly compact. 


3.7. COROLLARY.* Jf T is a weakly compact map of an abstract M-space 
(or an abstract L-space) into itself, then T? is strongly compact. 


Proof. The second conjugate of an abstract M-space is an abstract M-space 
with unit (10, p. 1023) and hence is isometrically isomorphic to C(S). From 
Corollary 3.4 we conclude that (7**)? and hence J? is strongly compact. 
The second assertion follows from the first and the fact that the conjugate 
of an L-space is an M-space. 


We now turn to the representation of a compact operator on C(S). 


3.8. THEOREM. An operator T : C(S) — & is compact if and only if the vector 
measure uw: & — X corresponding to it (as in Theorem 3.2) takes its values in a 
compact subset of %. 


Proof. if T** is compact, then by the construction of yu, we see that the 
condition is necessary. To see that it is sufficient, it is enough to prove that the 
set R of all sums of the form 


Da u(E,), 


where the {£,} are disjoint and |a,| < 1, is a totally bounded set in %. Let 
¢ > O be given and let M be the semi-variation of u on S. Select a set {8,,...,8,} 
of complex numbers with |8,| < 1 such that if |ja| < 1 then there exists a 
B; = B(a) with |B(a) — al < ¢«/2M. Let {F;,...,F,}C = be such that if 


‘This result was proved for a concrete L-space by Dunford and Pettis (3), and also by 
Phillips (13), by explicitly representing the weakly compact operators. It implies that if an 
abstract M- or L-space contains an infinite dimensional reflexive subspace, then there is no 
bounded projection mapping onto this subspace. 
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E€ =, then there isan F, = F(E) with |u(F(E)) — »(E)| < «/2p. Then from 
the definition of the semi-variation 
| 


FS arn(E) — ¥ ate.) Ed| = | ¥ tar — Bed) 


Further, 





€ 
< ou !@ = te. 


> B(a,) w(E,) 


can be written as a sum 
2 
, B; u(Es), 
j=l 


with {E,'} a disjoint family in =. Thus 


2 6: u(E,’) — 2, 83 B(F(E,)) | < > lw(E,’) — w(F(E/))|< P35 = he. 


We have shown that each element in & can be approximated within « by sums 
of the form 


> B; u(F,,), 


so that & is totally bounded. 


4. Special cases. It is easy (2) to give representations of operators which 
map a Banach space into C(Q), where Q = {t} is a compact Hausdorff space. 
Thus an operator T : C(S) — C(Q) may be studied from two standpoints. We 
now give kernel representations of operators in these spaces. We remark that 
in the case that S = Q = [a, b], the general and compact operator was repre- 
sented by Radon (14), and a representation of the weakly compact operator 
similar to Theorem 4.2 was given by Sirvint (16). In what follows S and Q are 
compact Hausdorff spaces. 


4.1 THEOREM. If T is an arbitrary bounded operator from C(S) to C(Q) then 
T can be represented by the formula 


(*) 


(TH) = JI) Kas,0, FE CS), 1 € 
where K is defined on = X Q to the scalar field and satisfies 
(i) K(.,#) € R(S) for each t € Q; 
(ii) the integral in (*) is in C(Q) for each f € C(S); 
(iii) sup|K(-,2)| = |T| < @. 
teQ 


Conversely, if K satisfies these conditions, the operator defined by (*) maps C(S) 
into C(Q) and has norm given by (iii). 
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4.2. THEOREM. If T is a weakly compact operator from C(S) to C(Q) then T 
can be represented by formula (*), where K satisfies (i) and 
(ii’) K(E,-) € C(Q) for each E€ =; 
(iii’) |K(E, )| < |T| < © for EE 2, t€ Q. 
Conversely if K satisfies these conditions, the operator defined by (*) is a weakly 
compact map of C(S) into C(Q) and has norm given by (iii). 


Proof. It is easily seen from Theorem 3.2 that a weakly compact operator 
must have this form. To prove the converse, let u : 2 — C(Q) be defined by 
u(Z) = K(E,.). Then if {E£,} is a disjoint sequence in ~, it follows from (i) 
that 


K( Uz,t) = >: K(E,9), te Q, 


t=1 i=l 


and from the well-known criterion for weak convergence in C(Q) that x*y is 
countably additive for any x* € C*(Q). With (iii’) this implies that yu is a vector 
measure in the sense of Definition 2.1. 


For some purposes the following representation may be more convenient. 
It is derived readily from the above and the Radon-Nikod¥ym theorem. 


4.3. THeoreM. If T : C(S) — C(Q) is weakly compact, then T can be repre- 
sented by the formula 


(+) (TH) = Ff #05) Rls, 1) v(as), FEC(S), 1 EQ; 


where v is a positive Borel measure on (S, Z) and k is a function from S X Q 
to the scalar field such that 


(a) k(-,t) € L(S, 2, v) for eacht € Q; 
(b) Sx, -) v(ds) € C(Q) for each E € =; 


(c) cup f \k(s, t)| v(ds) = |T| < o. 
teQ 8s 


~ Conversely, if k and v satisfy these conditions then (+-) defines a weakly compact 
operator. 


4.4. THeoreM. If T : C(S)-—+ C(Q) is compact, then T can be represented 
by the formula (+-) where v is a positive Borel measure on (S, =) and k satisfies 
(a), (c) and 


(b’) lim Jee. t) — k(s, to)| »(ds) = 0, to € 0. 


Conversely, if k and v satisfy these conditions then (+-) defines a compact operator. 


ee 





— 
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CONFORMAL MAPS WITH LEAST DISTORTION 
H. G. HELFENSTEIN 


1. Introduction. Our problem is related to the construction of geographical 
maps as follows. The reason for using conformal geographical maps is that the 
scale (viz., the ratio of two corresponding line-elements) does not depend on 
the direction of these line-elements. In an ideal map the scale, being respon- 
sible for the preservation of shape, would also be independent of the points 
where the line-elements begin. Since this is impossible, except for developable 
surfaces, one tries to construct maps whose scales are ‘‘as constant as possible.” 
According to the precise meaning of this expression several “‘best’’ maps are 
possible. Tchebycheff (1) studied the case in which the maximum deviation 
of the scale from a certain constant is minimized. We shall consider here the 
problem of minimizing the mean quadratic deviation of the scale from a 
constant. In order to linearize this problem we have to use, as Tchebycheff did 
also, the logarithm of the scale instead of the scale itself. We prove the exis- 
tence of a best map in this sense for a simply connected domain on an arbi- 
trary surface. In addition we give some explicit formulae for computing it. 

The definition of the mean quadratic error depends on the choice of the para- 
meters on the surface. We use a “‘normal’’ system of isothermic coordinates 
mapping the given surface on the interior of the unit circle of a complex plane. 
Then our problem reduces to that of the best approximation of a given function 
by harmonic functions. 


2. Notation. Let D be a finite, simply connected domain with more than 
one boundary point on a surface S which is sufficiently ‘“‘smooth.”’ Assume that 
we can find an isothermic system of parameters mapping D on a plane schlicht 
domain D’ which in turn can be transformed conformally on the interior of 
the unit circle U of a complex z-plane. If z = x + iy, then x and y are again 
isothermic parameters on D which we call a normal system. It is uniquely 
determined if we let an arbitrary point M of D correspond to the point z = 0 
and an arbitrary direction through M to the direction of the positive x-axis. 
The line-element of D becomes in the x, y system: 

 ® 24 ayy) = lds] 
“ ve) ¥ + ay) = 1G)’ 
where (z) is a real positive function. Every other conformal map of D can be 
obtained as a regular analytic function w = u + iw = f(z) defined in U whose 
derivative must not vanish. In order to fix the image of U with respect to 
congruent transformations we require that 


(1) f(0) = 0, arg f’(0) = 0. 
Received August 16, 1954; in revised form January 28, 1955. 
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We can prescribe also the value of the scale in the fixed point M: 


(2) If @)| = =): 


The distortion in a point x, y now becomes: 
(3) = el. pce) 42! = reve). 


Since f’(z) # 0 and y > 0 we can put 

(4) R log f’(z) = log |f’(z)| = o(x, y), 
log y(z) = ¢(z), 

whereupon (3) takes the form 

(5) m= ¢** 


¢@ being a harmonic function in U. The smoothness of D is now specified as 
follows: The function g(x, y), which is determined by D, shall be at least twice 
differentiable with the derivatives bounded in |z| < 1. 

If m is to be as constant as possible in U then the same must be true for 
@ + ¢. We shall therefore determine a harmonic function ¢ and a constant 
C such that 


(6) [= S Je +e- C)*dx dy 


becomes a minimum subject to the above mentioned side conditions for f(z). 
In particular (2) reads as follows: 


(7) log|f’(0)| = (0) = log =) 


3. Existence theorem. Jf ¢(z) = ¢(R, 0) belongs to the class C™ in |\z| < 1, 
(a) There exists a function o(R, 0) harmonic in |z| < 1 with 


$(0) = log 0) 


and a constant C which minimize the integral (6). 
b) Jf 
¢(R, 0) —C= > d,(R) e™ 
k=—o 
then b, = A, R*, 
Qe 
where Ay=- A+1f f ¢(R, 6)(Re~)*R dR dé, 


and 
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(c) The mapping function subject to conditions (1) and (2) is given by 


oe ht yr. 
f(z) = 50) . exp( 235 Ait ) dt. 


Proof. Introduce polar coordinates by 
x= Rcos#, y=R sin 8, 
and expand in Fourier Series 


+c 1 Qe ss 
(8) g= p> a,(R) ” aa a,(R) = xf. ge *° 0, 
k = 0, +1, +2,..., 
+c 
(9) ¢+e¢—-C=  C(R)e™. 
k=—oo 


Since these functions are real we have 


(10) a. = dh, C,=(C, 

for every k; since they are one-valued we get 

(11) a,(0) = 0, C,(0) =0 

for k ¥ 0, and 

(12) ao(0) = 9(0), Co(0) = (0) + o(0) — C. 


Combining (8) and (9) we obtain 


(13) = D [C(R) — a(R) e™ + C, 


and integrating the condition VY? ¢ = 0 under the side conditions (10)—(12), 
we find, for k > 0, 


(14) C,(R) = A,R* + a,(R), C(R) = C,(R) 
where the sequence of the constants of integration A, (k = 0, 1,2,...) istobe 
determined by our minimum postulate. 


We use now the following identity which is true for every pair of complex 
numbers P and Q: 


P|? + PQ + PQ = |P + Q|? — |QI?. 
Applying it to 


_ A; a ; k+1 
P= 2+)’ Q= V2k+1)f a(R) R dR, 


we obtain from (14): 


: 2 A; 1 _ |2 
as) J lccyrRar = try + V2E+1 J an(R)R aR| 





1 2 1 
a k+1 2 
2(k + 1) J an(R) R dR| + f \a,(R)*|R dR. 
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The completeness relation yields now for the integral (6): 
1 oa 1 
(16) = ar f |\Co(R)|’RdR + 42 >> f |C.(R)|*R dR, 
0 k=1 0 


where the interchange of summation and integration will be justified later. 
Taking into account (15), (16) assumes the form: 


I=2n 











As + v2 f a(R) RaR| — 4r| far) RaR|, 
(17) + anf ai(R) RdR 


— 2(k + o ~ RaR| + f wt 


Here the unknown constants appear only within squares. Consequently J 
assumes its minimum for the following values of the constants: 


1 
(18) A, = — 2(k +1) J a,(R) R**'dR, k=0,1,2,..., 
0 
or, according to (8): 
ar 
(19) inom -2t1 J o(R, 0)(Re-")EtR AR dd. 


Combining (11), (12), and (14) one recognizes that the constant C is also 
completely determined: 


C= ¢(0) — Av =1 ot ttf ig ¢(R, 6) RdR dé. 
From equations (13) <:id (18) one concludes that 


o = log 50) + 20° R* (Ai, cos k0 — Aj’ sin k6), 


where A,’ = RA, and A,” = $A,. 

From this expression one finds the conjugate harmonic function in the usual 
way, taking into account (1) and (2) in order to determine the constants of 
integration. This finally leads to 


_— #7 y k 
(20) f(z) = 7(0) f exp( 25: Ai Dat 
4. Convergence. The convergence of the power series 
(21) > Az" in |z| <1 
1 


follows from known estimates of the Fourier coefficients. If ¢ is h—-1 times 
differentiable with respect to @ and possesses a piecewise continuous derivative 
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of order h with a bound M,, then h-times repeated integration by parts of (8) 
yields: \a,(R)| < M,/k*. 

Combining this with (18) we have 

(22) |A.| < 2 M,/k’. 


For |z| < ro < 1 the series > A,2* is therefore majorized by }>2Moro* which is 
independent of z. The boundness of ¢ is thus seen to be sufficient for the circle 
of convergence of (21) to contain U. 

If h = 2 (as previously assumed) then (21) is also uniformly convergent on 
the boundary of U (which of course cannot prevent the function f(z) from 
possibly having a singularity on the boundary). 

From (14) and (22) with h = 1 one concludes the uniform convergence of 


> 1c. (R)|’, 0<R<1, 
1 


which has been used in the interchange of summation and integration in (16). 


5. Mean value theorem. For every integer k > 0 the mean value of the func- 
tion o(R, 0)R* e*** on the unit circle |\2z| < 1 is numerically equal but opposite in 
sign to the mean value of o(R, @)R* e***. 


Proof. Equation (13) yields: 
C.(R) — a,(R) = 5 foe. 0) e~ "aa, k = 0, +1, +2,... 
hence we have from (14) and (19) 
Ait a - 92% f fe, 0) rte" dr dg = + foe, 6) «aa 
0 Jo 27 Jo 


T 
(k > 0). 


Multiplying this last equation by R**' and integrating with respect to R from 
0 to 1 we obtain the required result. 


6. Determination of the power series of the mapping function. The co- 
efficients of the power series 


23 = —-> C2" 
(23) F@) 7(0) > 
can be determined by differentiating twice both (20) and (23). 
(24) 1) # (6) = exp( 23° Au) = > nC,2"", 
1 1 
and 


2 exp( 25° Aid): » kA," = p> n(n — 1) C,2"*. 
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Substituting the first factor from (24) we get 


( > n or\(¥ k A) = > n(n — 1) C2". 
1 1 1 
Equating corresponding coefficients we obtain 


2>> AnC, = j(j = 1) C;, 


where on the left m and m take the values 1, 2,...7 — 1, their sum being 
always j. Hence 


j-1 
C, eto E of — 9) Ap Ln FMB EQ ccce 


jG — 1) 
Together with C, = 1 this recursion formula determines all the C,'s from the 
A; 's. 


7. An integral representation of log f’(z). Instead of using a series we can 
express our mapping function also by an integral. 


Starting from (20) ” 
lo] 2 pr ) | = 23 Ax’, 


we replace on the right the A,’s by their values (19) and interchange formally 
summation and integration: 


1 or oa 
1] 1 pe )|- -2{ J o(R, 6) 2 (k + 1)(2Re~")*R GR do. 


Putting gq = z R e~* the series on the right is obtained by differentiating the 
geometric series a 
yo g** 


k=l 
with respect to g. Hence 


(25) toe] 2 pe )|- 2, f o(R, |i - a 1 as | RaR ao 


For |z| < ro < 1 and R < 1 we have |q| < ro < 1; consequently the above 
series is uniformly convergent. For |z| < 1 the double integral in (25) is a 
proper integral of a continuous function; for |z| = 1, however, it does not exist 
in general, not even as an improper integral for in this case 
\1 — zRe~*| = |z — Re*|. 
Equation (25) may be transformed in the following way: Let 


=f, (R60) = ¥(R,$). 
Then, since the function ¢ ¥(R, £) is defined continuously for 0 < R < 1 and 
lt] = 1, we can form the expression 


1 6 SHR) 
(26) (Rt) = 555 SE at, 
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where the path of integration is the unit circle of the ¢-plane. For |t| < 1 
this function is ay in t, and we have 


SH(R, 2) ae 
(27) = t) = = (¢—t)? 
Transforming the Phe in (24) and using (25) and (26) we obtain 


(28) le] 1p (z )|- = sf | 23 (R, 0) — ox (R, sR) | RAR. 


One of the integrations is now relegated to the complex integration (26), which 
may be simpler. 

For instance, let us make the further assumption about y, that ¢ ¥(R, f) 
can be extended into the interior of the unit circle of the ¢-plane as a regular 
analytic function of ¢. Then the integration in (26) can be carried out by 
Cauchy’s formula and yields: 


x(R, t) = ty (R, 2). 
Hence we have from (28): 


(29) lo] 1p c) | - sf | ver 0) — ¥(R,2R) — 2R > (R, :R) | RaR. 


It is remarkable that the minimum value of J can be expressed by means of 
the function (26). 
Let 


x(R, t) = > b,(R) t*. 


Then it is easily seen from (17) that 


(30) Tin = 2x} ff cer dR — 2| faw RaR| ; 
+ 4D { J acer dR — 2(k + 1) fn) R“ar|'t . 


8. Eyample. If the function ¢ does not depend on @ the above-mentioned 
condition about the extension of {fy is satisfied. Hence (29) is applicable and 
yields 


7(0) ~ — 
(31) log| 7) f' «)| =0, f(z)= (0) = 


Let now D be a circular domain on a sphere. Using stereographic projection 
from one of the endpoints of the diameter perpendicular to the circle’s plane, 
the line-element, and therefore also the function ¢, obviously do not depend 
on 6. Consequently equation (31) holds, which means (assuming the scale to 
be 1 at the centre): For a circular domain on a sphere the conformal mapping 
with least distortion is its symmetrical stereographic projection. 
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9. Unsolved problems. In conclusion we mention three natural questions 
which we leave unanswered. 


I. We have not studied the problem of the best choice of the point M on the 
surface (“the centre of the map”) whose image is z = 0. Another choice M, 
instead of M would lead to another normal isothermic system 2; = x; + iyi, 
mapping D again on the unit circle. Consequently 

2, = L(z) = g" 28 


a a? 0< 1 < 22, la| < 1, 


where L(z) is a linear function mapping the unit circle onto itself. M, has in 
the old system the coordinates z = L~'(0) = a. For the line-element of D we 
have 





ae = léth _ lds) _ JL'(s)| la 
7(z) ¥1(21) yilL(z)} ’ 


and therefore 
¥i(21) = |L’ (z)|+ (2), ¢i(z:) = log yi(2:1) = log |L’(z)| + ¢(z). 


Substituting this in Inn,(30) we obtain a function of the complex quantity a, 
which we have to minimize by a suitable choice of a. The existence of such a 
minimum (under the side condition |a| < 1) seems difficult to prove in general. 


II. For which regions D will our map become a schlicht domain? A general 
answer to this question is difficult; we can hope that for not too large regions 
which are not too much curved our map will be schlicht, and perhaps more 
precise sufficient conditions can be found. 


III. Conformal mapping in the large: Given two conformally equivalent 
closed surfaces find the mapping with least distortion (in some sense). Since 
the family of all the conformal mappings in this case depends on a finite 
number of parameters our problem reduces to a minimum problem instead of a 
variational problem. But even so the existence of a best map is not obvious. 
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AN APPLICATION OF SOME SPACES OF LORENTZ 
P. G. ROONEY 


1. Introduction. The spaces A(a) and M(a) were defined by Lorentz (2) 
as follows. Let 0 < a < 1,0 <1 < @; let ¢ be measurable on (0, J), and, in 
case / = , let the set where |¢(x)| > « have finite measure for each positive e. 
Define 


I 16(-)Ilae = af x6"(x) de 


where ¢*(x) is the equi-measurable rearrangement of |¢| in decreasing order, 
and 


u [16(-DIlauw = sup (m(E))~* f \4(e)| de, ES (0, 0). 
The spaces A(a) and M (a) consist of those ¢ for which 


I1o(-) | lace < @, 1b (-)! [seca <@ 
respectively. 

Lorentz (2; §5) found, among other things, necessary and sufficient condi- 
tions that a given sequence be the moment sequence of a function in either 
A(a) or M(qa), for / = 1. It is the object of this paper to find necessary and 
sufficient conditions that a function f(s) on s > 0 be the Laplace transform 


of a function in A(a) or M(a) for 1 = ~. To this end we make use of the 
Widder-Post inversion operator, 


11] Ly. df(s)] = "(4) f(#), 


whose theory may be found in (4; chap. VII). 


Section 2 of this paper contains the theory for the spaces A(a), and §3 the 
theory for the spaces M (a). 


Henceforth when / < @, we shall denote the spaces A(a), M(a), L,, over 
(0,2) by A(a,/), and their respective norms by ||¢(-)||ace,». We shall continue 
to denote the spaces A(a) on (0, ~) by A(a@) and the norms by |/¢(-)|| aca. 


2. The space A(a). The first theorem yields some properties of the Laplace 


transform of a function in A(a), while the second theorem is the representation 
theorem. 


THEOREM 1. If ¢€ A(a), and 
f(s) = freeway, 


Received September 21, 1954. This work was done at the Summer Research Institute of the 
Canadian Mathematical Congress. 
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then 
i f s “|f(s)| ds < @. 
N 0 

If $ 1s positive and decreasing, then the above condition is necessary and sufficient 
) that @ € A(a). 


Proof. Suppose ¢ € A(a). Then 
io) ds < fosas free! dt 


- Siew frets as sti-2 folewia 
< a“ T(1 — @)||6(-)||ae < @. 


Conversely, suppose ¢ is positive and decreasing. Then 


Josie as =a 'T(1 — a) ||¢(-)||ac, 
and @€ A(a). 


THEOREM 2. Necessary and sufficient conditions that a function f(s), defined 
for s > 0, be the Laplace transform of a function in A(a) are that 

(1) f has derivatives of all orders in (0, ~) and f®(s)—-0 as s— @ 
(k = 0,1,2,...), 

(2) ||Ze.. L£(s)]Ilacey < WN, where N is independent of k (k = 0,1,2,...). 


“-) 


| Proof of necessity. Let 


f(s) = foe'e@ at, @ € A(a). 


. The necessity of (1) is well known; see (4; chap. 2, §5). 
Now by (4; chap. 7, §6), 
. Ladi) = JK, u) o(u) du, 


where K (t, u) = (R/t)**+! e~™/'(u*/k!). Thus K(t, u) > 0, and 


J "K«, u) du = J°K«, u) dt = 1. 
Hence, by' (3; Theorem 3.8.1), for each a > 0, 
fu f(s)|"dt < fo dt, 
and thus by (3; Theorem 3.4.3), for any a > 0, 
a f Le, di(s)Iat < af rea. 
~ This theorem, like all of Lorentz’s, is stated for the case / = 1. However, all of Lorentz’s 


theorems used here with one exception (to be noted later) are true for / infinite, as a glance at 
} the proof shows. 
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Letting a > ~, we have 


Le. (f(s) ] lacey < |16(-)| laces 
and (2) is necessary. 


Proof of sufficiency. By (2; 3.5(7)), if g(t) is positive and non-increasing 
frremra< Kd fiemia}’, p>. 
0 0 
Let p = 1/a, g(t) = t*-' L, .[ f(s)]*. Then, the above result yields 
UO) on Oa 


co l/a 
< Kin) f o"le Isitar} < Ki,.N'". 


Hence, 
||Zx.-Lf(s)]| | zee) <N’ 
where N’ = K*;,. N. 


Thus, by (4; chap. 1, §17, and chap. 7, §15), ¢ € L(1/a), and an increasing 
unbounded sequence {k,} exist such that 


@) |16(-)Ilzam <2", 
Gi) 46) = fe"ew a 
(iii) for any y € L((1 — a)-"'), 
tim J ¥O Ladslat = fv 0 at 


It remains to be shown that ¢ € A(a). 
But by (3; Theorem 3.6.1), for any ¥ € M (a), 


| © 
| f v(t) La. dis) | < LH C-) | | aac ||Ze, Lf(s)] lace <N ||¥(-)| lure. 


Hence, by (iii), and since, by (2; 1.3(4)), L((1 — a)-") C M(a), for any 
ve L((l — a)-), 





fv smal - tim | 90) La. f@)1dt| <¥ IIe 

Changing y to ¥ sgn ¢, we have for any positive y € L((1 — a)-") 
J VOLO a <N Il¥-)Ilac 

and thus, by (3; Theorem 3.4.2), for any positive y € L((1 — a)-"), 


fv @ (t) dt < N ||¥(-)||mceo. 
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Let y(t) = af™',0 <8 Ct < R, W(t) = 0 otherwise. Then y € L((1 — a)~), 
and ||¥(-)||sca) < 1. Hence 


R 
af fo" (t) dt < N, 
é 
and so, letting 6+ 0, R- @, we have 


l$(-)Ilacy < @, 
and @€ A(a). 


3. The space M(a). The first theorem of this section yields some properties 
of the Laplace transform of a function in M(a), while the second theorem is 
the representation theorem. 


THEOREM 3. If ¢€ M(a), and 
f(s) = Jeo at 


then s* f(s) is bounded for s > 0. If ¢ is positive and decreasing, then the condi- 
tion that s* f(s) be bounded is necessary and sufficient for ¢ € M(a). 


Proof. Let ¢€ M(a). Then if s > 0, by (3; Theorem 3.6.1), 
YO < feo at < [excl] O(c 


= af fe *'dt||6(-) || mca = S*T (a + 1)||6(-) || ece, 
and s* f(s) is bounded. 


Conversely, suppose ¢ is positive and decreasing, and s* f(s) is bounded. 
Let 6 > 0, and 


1 1 
5, <8 <5: 


Then 
t) 3 - 
f o(t) dt < e" f e*'o(t) dt < ef e*' p(t) dt < Ms < M's, 
0 0 0 
so that ||¢(-)||mc) < M’ and ¢€ M(a). 


THEOREM 4. Necessary and sufficient conditions that a function f(s), defined 
for s > 0, be the Laplace transform of a function in M(a) are that 

(1) f has derivatives of all orders in (0, ~), f(s) +Oass— @ (k = 0,1, 
2,...) 


" (2) IIL. [f(s)IIluace <N where N is independent of k (k = 0,1,2,...). 
Proof of necessity. Let 
fs) = feo a, 6 € M(a). 
0 


The necessity of (1) is well known. 
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Now as in Theorem 2, 
1 k k+l oo ptt oo 
Ly. df(s)] = Rl ( . f e* b(u) du = -— f eu o(tu) du. 
!\2 0 k! 0 
Hence, if m(E) = 4, 


| i ahi oo 
5* | |Z, Af(s)]| dt < - f eu‘ du f |p(tu)| dt 
gE . 0 E 


ptt @ 
= 77 f eu t—"du(us)* f | p(t) | dt 
: Jo ul 


where uE = {tt = uv,v € E}, so that m(uE) = um(E). 
Thus 





—a | | .. | 5 —ku. k+a—1 
6 _ ee elf(s)]! at < pr eG) | M(a) € 4&t du 
= ||6(-)||mcI(R + a) /k°T (R). 
Hence, since '(k + a)/k* I'(k) is bounded, we have ||Z,.. [f(s)]||miey < N. 
Proof of sufficiency. It is clear that 
Le. Lf (s) I lacen < ||Le,. (£(5)]| | ace. 
Further, by? (2; Theorem 4), M(a, 1) C L((1 — a’)—', 1) and 
| IL, .[f(s)]| ln«a—e’) Los K,| Ly..[f(s)]| | Mia, Ds 
for every a’, 0 <a’ < a. Let a’ be fixed 0 < a’ < a and let {1,;} be a positive 
increasing unbounded sequence. Then by (4; chap. 1, Theorem 17a), since 
||Lx.-Lf(s)] | |zca-e 1,29 < KiN 


there is a function ¢; € L((1 — a’), J) and an increasing unbounded sequence 
{ka} such that 


1]6(-){|zca-e-2,24,9 < Ki,N 
and 


Him f VO) Len df(sdlat = f ve) arte at 


too 


for every ¥ € L(1/a’,1,). Further, since 


| |Zx...-LF(s)]| |zea—e")-2, 4) < K,N 


there is, by (4; chap. 1, Theorem 17a), a function ¢2€ L((1 — a’), 1.) and 
an increasing unbounded sequence {ki} C {kw} such that 


I]G2(-) | |zea—ey-2,2,) < Ki,N, 
and 


*Lorentz states that this theorem is true for / infinite also. However, this is not the case, as 
it would imply untrue relations between the Lp spaces. 
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ls ls 
lim V(t) Le... Af(s)] dt = f v(t) d2(t) dt 
tro V0 0 
for every ¥ € L(1/a’, 1,). Inductively, since 
||Le, jaa LS (S)) | | z0a—e-2, 49 < K,N 


there is a function ¢, € L((1—a’)-', /,), and an increasing unbounded sequence 
{Ris} Cc {Rs 1} such that 


I]5(-)||zea-e-+,4)9 < KyN 
and 
lim J wo Ly;, Af(s)] dt = f v(t) o(t) dt, 


for every ¥ € L(1/a’, 1,). 

But, if 7 <j’, o,(¢) = ¢y(2) for almost all ¢ in 0 <¢ <1,. For ¢, — oy 
€ L((1 — a’), 1,), and hence if YE L(l/a’,];,) and PJ=y, O<t< l;, 
y = 0,¢ > 1,, then since J € L(1/a’,1y), and {kiy} S {Ry}, 


J, vow. - orma= [vosma- fo oma 
Him J YC) Lun df(s)) at — tim f° Fe) LaweaAlfls)) a 


tim) "YO Law Afton dt — f° 9) Law dfn att = 0 


Thus by (1; chap. IV, §4.2 and Theorem 3), ¢,(¢) = ¢,(t) almost everywhere 
nd<ti<l,;. 


For each t > 0 let o(¢t) = $,(t) where j is the least i such that ¢ < 1,. Then 
clearly ¢ € L((1 — a’), l) for each 1 > 0, and if ky = ky, and y € L(1/a’, )), 


1 I 
tim fv) Ladsnlat = f ve o@ at 


Further, ¢ has a Laplace transform. For if s > 0, then 


e*'sgn(o(t)) € L(1/a’, 1) M Ala) 
and thus by (3; Theorem 3.6.1) 


feel 


| J e*'sgn((t)) o(¢) a 


i 
lim | ff e-rsen(6) La. dstool a 
tow 0 
< |e" | cae) lim sup} |Ly,,.[f(s)]| me < "I (a + 1) N. 


freee dt 


Thus 
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exists for s > 0. Also, 
lim e *'Ly,. [f(s)] dt = f e **o(t) dt, 
too 0 0 


for each s > 0. For, by (3; Theorem 3.6.1.), 
f "Len, Lf (s)] dt | < | (Les. f(s) ] | |arey + J fe" 
t t 


< Naf edt <e« 
i 





and we may also choose / so large that 


f e "| o(t)| dt <«. 
I 
Then, 


lim sup | freoo — Ly, df(s)]}) dt| 


< lim sup | f e *'(o(t) — Le, Af(s)]) dt| + 2e = 2e, 


and thus since ¢« is arbitrary, 


lim e "Ly. Af(s)] dt = f e *' p(t) dt. 
0 0 


tio 


But by (4; chap. 7, Theorem 11b), this last limit is equal to f(s). Thus f(s) 
is the Laplace transform of ¢, and all that remains to be shown is that 
@€ M(a). 

But by (4; chap. 7, Theorem 6a) 


lim Ly, {[f(s)] = $(¢) ae. 


Hence if E is any subset, of measure 5, then from Fatou’s lemma 


f |@(t)| dt < lim int f lLe, Lf (s)]| dt < N& 
E + E 


Hence 


[16()||aa) = sup ~* io) uUeNn 


and ¢€ M(a). 

In conclusion it may be mentioned that results of the type obtained in 
theorems 2 and 4 hold for considerably more general spaces than A(a) and 
M (a). For example, analogues of these theorems hold true if the values of f(s) 


be in a reflexive Banach space; the proof of this fact is much like the proofs 
given here. 





aos 
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ISOMORPHISMS OF FACTORS OF INFINITE TYPE 
R. V. KADISON 


1. Introduction. One of the striking results of the work done by Murray 
and von Neumann (9) in the analysis of rings of operators' on a Hilbert space 
is the reduction of the unitary equivalence problem for certain types of 
factors? to the problem of algebraic equivalence. Roughly speaking, they 
associate with each concrete representation of a factor a number (which 
measures the relative size of the factor and its commutant)—the so-called 
“coupling constant.” Two factors are unitarily equivalent if and only if they 
are algebraically isomorphic and have the same coupling constant. Somewhat 
more precisely, Murray and von Neumann show that the given algebraic iso- 
morphism can be implemented by a unitary transformation. Their results do 
not apply to factors of type III nor does the result concerning the possibility 
of implementation of an isomorphism by a unitary transformation apply to 
the case of II, factors with Il, commutants. Recently* Griffin (4; 7) pointed 
out the surprising fact that (at least in the case of a separable Hilbert space) 
every isomorphism between factors of type III can be implemented by a uni- 
tary transformation. By combining the techniques of Nakano (10) and Segal 
(11) in their multiplicity theory of abelian rings of operators with the global 
ring techniques of Dixmier (1) and Kaplansky (6), and the Dye-Radon- 
Nikodym Theorem (2), Griffin (5) was able to extend the concept of “coupling 
constant” from factors in the separable case to “coupling operator”’ for rings 
of operators on an arbitrary Hilbert space. He thereby extended the unitary 
equivalence results of Murray and von Neumann to rings of operators. How- 
ever, there does not seem to be a description, in the literature, of the possible 
isomorphisms between II,.’s with II, commutants (Griffin’s results are worded 
so as to exclude this case). One knows, for example, that each +-automorphism 
(adjoint-preserving automorphism) of a factor is implemented by a unitary 
transformation of the underlying Hilbert space provided the factor is not of 
type II,, with a II, commutant. What the situation is, in this last case, seems 
to be unknown. This note supplies the missing information concerning iso- 
morphisms between rings of type II,, with I1, commutants. In particular, we 
show that the group of unitarily induced automorphisms of a factor of type II, 
with a II, commutant is a normal subgroup of the group of «-automorphisms 

Received September 27, 1954. This research was carried out during the tenure of a Fulbright 
Grant. 

1A ring of operators is a weakly closed, self-adjoint algebra of operators on a Hilbert space. 

A factor is a ring of operators whose center consists of scalar multiples of its unit element. 

*The author is indebted to E. L. Griffin for having had this result and proof made available 
to him in 1952. 
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and that the quotient group is (canonically) isomorphic to the fundamental 
group‘ of the II, commutant. 

The results of the present note will be employed in a forthcoming account 
of the unitary invariants of representations of arbitrary C*-algebras. 


2. The automorphism group. The first question with which we shall deal- 
the nature of the *-automorphisms of a factor of type II, with a II, commu- 
tant on a separable Hilbert space—is the simplest one, from a technical view- 
point, but, nevertheless, contains all the essential features of the more general 
investigation of the next section. 


THEOREM 1. Jf Mts a factor of type 11, with dimension function D and com- 
mutant I’ of type I1;, then the mapping which takes each *-automorphism @ of 
M into D{(d(E)|/D(E), with E some fixed, finite, non-zero projection in M is a 
group homomorphism of the group, @, of *-automorphisms of M onto the funda- 
mental group of It with kernel, Ul, consisting of those *-automorphisms of M 
which are implemented by unitary transformations of the underlying Hilbert 


space, . 


Proof. Since D-@ serves as a dimension function on I (for each *-auto- 
morphism, ¢, of I), D-¢ is a constant multiple, say a(¢), of D. If E is chosen 
to be a projection in J with D(Z) = 1, then clearly, a(¢) = D[¢@(£)]. Thus, 
if » is another *-automorphism of I, then 

a(n-¢) = D[no(E)] = a(n)D(o(E)] = a(n) -a(9); 

so that @ is a group homomorphism of © into the group of positive reals. We 
examine the kernel of a. Suppose then that a(¢) = 1. Since QP’ is of type 
11, and M of type II., it is possible (8, pp. 178-180) to choose a unit vector x 
in § so that® [Mx] = H. Let F be the orthogonal projection on the space 
[M’x]. Then F lies in M and is finite. Moreover FIR F and M’ F restricted to the 
space [Mt’x] are factors of type II,, one the commutant of the other, with 
coupling constant 1, since x serves as a cyclic vector for both FMF and MF 
(recall that the total space under consideration, at the moment, is [?’x]). By 
assumption on ¢, however, D(F) = D[¢(F)], and, since F is finite, one can 
find a unitary operator, U, in J such that UFU-' = ¢(F). Now [MU*x] 
contains [IU-'Ux] = [Mx] = H, and 


[M’ Ux] = [UM’x] = U[M'x] = U(F(H)) = o(F)(H). 


Thus Ux plays the same role with respect to ¢(F) as x did with respect to F. 
It follows that ¢(F)M¢(F) and M’o(F) are factors of type II,, each the 


‘For the definition of fundamental group of a factor see (9). It should be noted that a factor 
of type II can be viewed as an infinite matrix ring over various factors of type II, all belonging 
to the genus of the II, and, so, all having the same fundamental group which we may call the 
fundamental group of the given II... This group is also the fundamental group of the com- 
mutant. 

’ We denote by [Px] the closed subspace spanned by vectors of the form Ax, with A in M. 
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commutant of the other, with coupling constant 1. Moreover, ¢ restricted to 
FYRF maps this ring isomorphically upon ¢(F)M¢(F). Since FMF as repre- 
sented upon F($) and ¢(F)M¢(F) as represented upon ¢(F)(H) have coupling 
constant 1, the known theory (9) tells us that there is a unitary transformation, 
U,, of F(S) upon ¢(F)(S) which implements the restricted ¢. Now choose 
orthogonal, equivalent projections F,, F,;...in I with sum I and with 
F = F,. Let V, be a partial isometry in 2? with initial space F,() and final 
space F,(H). (Take V,; = F;.) The map U,;, discussed above, transforms 
F,() onto ¢(F;)() and implements ¢ restricted to F;IF;. Define U, to be 
o(V,)UiV,*. Clearly, U, is a unitary transformation of F,() onto $(F,)(). 
We assert that U, implements the isomorphism ¢ restricted to F,INF,. In fact, 


U,F,AF,U,—* = @(V_)U1( Va" FA Fy Vn) U1" (Vn)* 
= $(V,)Ui(FiVi* FA F,V,F1) Ui-'$( V2") 
= $(V,)6(FiV,* FA FV Fi)b(V2") 
= $(V,FiV,"F,AF,VaFi Va") = 6(F,AF,). 


The transformation U defined to be U, on each of the spaces F,,() is a unitary 
transformation of § onto § and certainly implements ¢ on each of the rings 
F,INF,. Moreover, 


UV,U- = UV,.(Xe Ur) = UV,U = (Le Ud) VU = U,V, 
= (¢(Va)U: Va") VU = (Vq) Ui FU = 6(Vn)6(Fi) 
$(VaFi) = o(V,). 


Thus 


UAU- = U(Xin Fr)A (Som Fm) U-' = U(Xonm FnA Fm) U- 
>... UF,A F,,U-! 

= Dam UV_V_*A Vin Vin* U- 

= Diam (UV,U-") (UF V,*A Va Fi U-') (UV,,* U-") 

= Diam O(Vn)O(Vn*A VinO(Vin*) = Don O(Fn)O(A)O( Fm) 
= (Ln O(F,))6(A) (Lim O(Fm)) = OA), 


since each *-automorphismis countably additive We have established that 
each automorphism, ¢, in the kernel of a is induced by a unitary trans- 
formation of 5. Suppose, on the other hand, that ¢ is an automorphism of M 
induced by the unitary transformation U of §. Once again, choosing x a unit 
vector in § such that [Mx] = § and defining F to be the orthogonal projec- 
tion with range [9t’x], we have that F lies in Mt and is finite. In addition, 


[MUx] = (UU-' MUx] = U[Mx] = U(H) = §, 


and 

[MY Ux] = U[U" DY Ux] = U[M'x] = UFU-" U(H) = o(F)(SH). 
Now it follows (8, pp. 178-180) that [Mx] is equivalent to [P’ Ux] modulo M, 
i.e., D(F) = D[¢(F)], since [Mx] = Hisequivalent to [M Ux] = Hmodulo M'’, 
so that a(¢) = 1; and ¢ lies in the kernel of a. Thus we have identified the 
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kernel of a with the group U of unitarily induced automorphisms of I. Finally 
we show that the image of a is precisely the fundamental group of MM. In fact, 
with F a projection of relative dimension 1 in It, FMF is a factor of type I], 
and $(F)M¢(F) is its Di¢@(F)] = a(¢)th power‘ (for, if, say a(¢) < 1, then 
o(F)M¢(F) is unitarily equivalent to the restriction of FIF to any projection 
in FY of relative dimension a(¢)). However, ¢ induces an isomorphism of 
FMF upon ¢(F)M¢(F), so that a(¢@) lies in the fundamental group of FMF 
(which is the fundamental group of 2). Suppose that a is in the fundamental 
group of 9. Choose (9) two (infinite, sets of matrix units [Z,,]; ,.1.2,... and 
[Fislijmt.2,... in M, with E,», Fa» projections of relative dimensions 1 and a, 
respectively, for each n. If we denote by N; and ®, the sets of elements in M 
which commute with [E,,] and [F,,], respectively, then 2, and MN. are sub- 
factors of M of type I1,, and are isomorphic to E,,MELy, and FyMFin, respec- 
tively. Thus M, is the ath power of 9, and, since a lies in the fundamental 
group of M (hence, of N,), there is an isomorphism 7 of N, onto MN». Now M 
is isomorphic to the denumerably infinite matrix rings over N,; and over N, 
in which only those matrices occur which yield bounded operators on the 
denumerably infinite direct sum of § with itself. Let ¢:, ¢2 be these iso- 
morphisms and Jt,°, N.° the matrix rings, respectively. If A° is a denumerably 
infinite matrix over J, (or NM.) it will act as a bounded operator if and only if 
the bounds of the operators obtained from A® by replacing the entries whose 
row or column index exceeds m by 0, forms a bounded set of numbers. Now 9 
extends, in the obvious way, to a *-isomorphism 7, of the m X m matrix ring 
over Jt; onto the m X m matrix ring over N». Then », is norm preserving and it 
follows from the foregoing characterization of the operators in 9,°, R.° that 
n°, the extension of » to 91°, is a *-isomorphism of 9,° onto M.°. Under ¢; 
and ¢2, respectively, E,, and F\,, respectively, map onto the matrices in %,° 
and §,°, respectively, whose entry in the first column and row is I and whose 
other entries are 0. Thus the *-automorphism, ¢:~'n° ¢:, of M carries Ey, 
onto F),;. It follows that a(¢2~' 7° ¢:) = a, so that the homomorphism a maps 
onto the fundamental group of MM. 


COROLLARY. There exist factors of type I1,, with 11, commutants which admit 
non-unitarily induced automorphisms. 


Proof. Since the fundamental group of the approximately finite II, is the 
multiplicative group of positive reals, the automorphism group of the approxi- 
mately finite II, (bounded, denumerably infinite matrices over the approxi- 
mately finite I1,) contains distinct cosets modulo the group of unitarily 
induced automorphisms corresponding to each positive real number. The 
denumerably infinite matrix representation, acting in the usual way on the 
direct sum of Hilbert space with itself a denumerably infinite number of times 
has a II, commutant and provides the desired example. 

It is a rather surprising observation that, in a certain sense, the more 
complicated factors of type II, have the less complicated automorphism groups. 
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Indeed, one tends to think of a factor of type II,, whose fundamental group 
consists of 1 alone as being quite complicated structurally (the approximately 
finite factors of type II,, would appear to be the least complicated), while, 
in this case, all s-automorphisms are unitarily induced. On the other hand, the 
only information we have about the fundamental group of any factor is that 
the fundamental group of an approximately finite factor of type II, is the group 
of positive reals. It may well be that this is the fundamental group of all 
factors of type II,. It would be quite interesting to know, for example, the 
fundamental group of the factor of type II, which is the group algebra of the 
free group on two generators. 


3. The linking operator of an isomorphism. In this section, we shall deal 
with the more general situation of *-isomorphisms between rings of type II, 
with Il, commutants on arbitrary Hilbert spaces. 


DEFINITION. Jf @ is a *-isomorphism between two rings of operators MN 
and IM, of type I1,, with 11, commutants acting on the Hilbert spaces $, and Hz, 
respectively, we shall call the operator D|¢(E)| in the center of M2, the linking 
operator for o, where E is the projection in I, with range [Mii'x], where x is 
a unit vector such that [t,x] = 91, and where D is the center-valued dimension 
function on IN. normalized so that D(F) = 1. (F defined for M+ in the same way 
as E is defined for M1; assuming M1, Ms have countably decomposable centers. 


Several remarks are appropriate with regard to this definition. In the first 
place, the cyclic vector x exists since J, is of type II,, and Mt,’ is of type I1,. 
Secondly, E is finite with central carrier the identity, and any other projection 
in MM, arising from a cyclic vector such as x is equivalent to E. Since ¢ maps 
finite projections into finite projections and equivalent projections into equi- 
valent projections, D[¢@(£)] is independent of the choice of x and is a positive 
operator in the center of Dt. (observe that ¢(£) is finite and has central carrier 
I). Using direct sums, we assume our rings have countably decomposable centres. 


THEOREM 2. If ¢ is a *-isomorphism of the ring of operators IN; of type I, 
onto the ring of operators IN, and the commutants of IM, and M, are of type 111, 
then is implemented by a unitary transformation of 9, onto D2, the Hilbert 
spaces upon which M, and M2, respectively, act, if and only if the linking operator 
for is the identity operator. 


Proof. Suppose first that ¢ is implemented by a unitary transformation U 
of ; onto He. In this case, with the notation of the above definition, 
[M2 Ux] = U[U-' Mz Ux] = U[M:x] = U(H:) = Hz, 
and 
[M.”’ Ux] = U[U-' M,’ Ux] = U[My’ x] = U(E(H:)) = o(E)(H:2). 
Thus D[¢(E)] = I, for ¢(£) arises from a cyclic vector Ux, and is therefore 
equivalent to the normalizing projection F for D, in Me. 
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We assume now that D[¢(E)] = I, so that ¢(Z) is equivalent to the nor- 
malizing projection F for D and arises from a cyclic vector for Pty. Thus 
¢(E)M2(E) and M,.’ o(£) are of type II, with a joint cyclic vector, as are 
EM, E and M;,’ E. Moreover, ¢ yields a *-isomorphism of E Mt, E onto 
¢(£) M2 o(£). This last *-isomorphism is implemented (3; 5) by a unitary 
transformation U, of E(G,) onto ¢(E)(§2). Again, as in Theorem 1, one can 
find a (possibly uncountable) family of projections [Z,] in Qi, mutually 
orthogonal and equivalent to Z, with sum I, and with E as one of the projec- 
tions of the family. Let V, be a partial isometry in 2, with initial space 
E(§;) and final space E,(§;) (let E be the partial isometry with initial and 
final space E($,)). Define U, to be ¢(V.)U; V.*. As in Theorem 1, the unitary 
operator U defined as U, on E,(§:), for each a, implements ¢ on each of the 
rings E, Mt, E, and UV, U-' = ¢(V,). Thus the isomorphisms A — UAU-" 
and A — ¢(A) agree on a subset of I, dense in the strongest topology. Since 
both mappings are strongestly continuous (3; 5), they agree on Dt; and ¢ 
is implemented by the unitary transformation U. 

It is now a simple matter to incorporate the above result into the 
statements of Griffin (3; 5) to completely answer the question of when 
*-isomorphisms between arbitrary rings of operators on arbitrary Hilbert 
spaces are induced by unitary transformations. 
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REDUCIBLE DIOPHANTINE EQUATIONS AND 
THEIR PARAMETRIC REPRESENTATIONS 


E. ROSENTHALL 


1. Reducible diophantine equations. The present paper will provide a 
general method for obtaining the complete parametric representation for the 
rational integer solutions of the multiplicative diophantine equation 


@ Pp @ 


1.1 I] I] ; [fas (2151, tee Xejis) - I] I] (es(¥i5% tee Yuji) 


(k) tml jm (kK) i=l jul 


for some specified range of k, where the a;,, 6; are non-negative integers 
and the f;:, 4,; are decomposable forms, that is to say they are integral irredu- 
cible homogeneous polynomials over the rational field R of degree k in k 
variables which can be written as the product of & linear forms. 

Equations of the form 1.1 appear often in diophantine problems and many 
results concerning their parametric solutions are known. An account of some 
of these has been given by Skolem (6, pp. 64-69) where the most general 
equation of type 1.1 considered is 


1.2 S (x1, Xe, ..- Xn) = hyS'ys’. . . 95”, 

f being a decomposable form of degree n, and we note that 1.1 becomes 1.2 
when we restrict the ai, 6; so that di, = 1; a4 = 0 when i # 1, j ¥ 1, 
k 41; bin = €;; bin = Owheni ~ 1,k #1. 

The method illustrated there for the complete resolution of 1.2 uses ideal 
theory in the ring of integers of the algebraic number field K in which f splits, 
i.e. the field in which f can be written as the product of linear factors. In this 
process, by solving a simple multiplicative equation (3, p. 87) in the rational 


domain, the resolution of 1.2 is reduced to finding all x1, x2,...,x, and 
U1, U2,..., Um Satisfying an equation of the form 

1.3 N (wyxi + wore +... + wart,) = Dul'ud’... ube 

with the g.c.d. condition (x;, x2, ...,%X,) = 1 where wy, wo, ... , @, is a minimal 


basis of K and N(T) denotes the norm of the algebraic number T. The resolu- 
tion of 1.3 is then obtained by factoring each u, into prime ideals of all permis- 
sible degrees and then these prime ideals must be distributed among the linear 
factors of the left hand member in all possible ways so that these factors are 
conjugates and the equation is satisfied. No systematic method is provided 
for making this distribution and although the complete resolution of 1.2 is 
thus not too difficult in principle the above procedure when applied to 1.2 
itself is unwieldy and is certainly unserviceable for the complete resolution of 
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1.1. In view of these circumstances there appears a need for a straightforward 
method to handle with more facility the general completely reducible equation 
1.1. 

In this paper we shall consider 1.1 from a different point of view. If f is a 
decomposable form of degree m then it is well known (1, pp. 378-383) that a 
certain integral multiple of f or a form equivalent to f can be expressed as 
N (a; %1 + a2 Xo +... + _ X_) Where a, ae, . . . a, are integers of an algebraic 
number field of degree n. It follows that equation 1.1 can be replaced by an 
equation of the form 


¢ Pp 
1.4 al] I] I] [LN (cern igs H+.  . H Otee Xess) |" 
7 & 


(k) t=1 
a P 

- oT] I] IT [N (Bre 15¢ Bee dess))"" 
where a,x;, Bx; are algebraic integers of degree k and a, b are rational integers. 
The resolution of 1.1 thus reduces to solving a multiplicative equation in which 
each member is a product of linear forms in the normal algebraic extension 
E/R where E is generated by adjoining to R all the a,.;, 8. and their con- 
jugates. 

The general equation of type 1.4 associated in this form with a given normal 
extension is formulated below in 2.1 and in the development of our method 
for the complete resolution of 2.1 it is the Galois group G of the normal exten- 
sion E which plays a dominant role, and the method permits the equations to 
be classified according to their group G. We shall show that there is a corre- 
spondence between the solutions of 2.1 and the complete parametric represen- 
tations in multiplicative form of a multiplicative system of independent 
equations in the rational domain. From a knowledge of these parametric 
representations, which can be obtained by the method of Bell (2) or Ward (6), 
the complete solution of 2.1 can be written down. The passage from 2.1 to the 
corresponding multiplicative system in the rational domain and the return 
from the solutions of this system to the solutions of 2.1 only require the simple 
and straightforward computation in G as prescribed in Lemma 1. This is 
established by Theorem 2 and the algorithm in §5. 

In an earlier paper the author established Theorem 2 for the case where E 
is the cyclotomic number field (4, p. 219). Although not mentioned in that 
paper it is readily seen that the same proof shows that this theorem, as stated 
there, holds for any cyclic field. It was then natural to inquire whether or not 
a theorem of the same type could be obtained when the Galois group of E is not 
necessarily cyclic, and to answer this question has led to the present investiga- 
tion. 


2. Notations and conjugate elements. We now introduce some notations 
and definitions and state a few well-known properties of conjugate elements 
which are needed in this paper. Let E be a normal algebraic extension of 
degree m over R and let G be its Galois group; the subgroups of G will be 
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represented by g, and o(g) stands for the order of g. G can be partitioned into 
complete conjugate classes S,, S:,...,5, and g, will denote a representative 
group from the class S,;. All the small Latin letters will represent rational 
integers and large Latin letters, unless specified otherwise, will denote ideals 
in the ring of integers of E. The Greek letters ¢, 7,» are reserved for the 
substitutions of G. If a is an element of E then (a), called a conjugate of a, is 
the image of a under the substitution «. By «JT we mean the ideal obtained 
from T when we replace each integer a in T by o(a); the ideals oT are called 
the conjugates of T. All the substitutions of G which leave T unaltered form 
a group g; we then say that T belongs to g. All the substitutions of G which 
transform T into a given conjugate element rT form a left coset rg of g. It 
then follows that all the distinct conjugates of T are given without repetition 
by 7:7, 727, ...,7,T where 11, 72, ..., 7, form a complete set of left residues 
of G modulo g. Also if T belongs to g then the conjugate ideal oT belongs to 
ogo—'; thus conjugate ideals belong to conjugate subgroups and therefore the 
only ideals belonging to the set S;, i.e. to any group of S;, are the conjugates 
of ideals belonging to g,;. The letter P will be reserved for prime ideals. T,, 
will signify that the ideal 7, belongs to the group g;. 

If ru, Tea, «+ - Tegcy be a complete set of left residues of G modulo g, then 
with each permissible Galois group G we can associate the multiplicative 
equation 


h t a t 


2.1 I] ; [7 aX sa7 aX i.e ee Tig Xia) = I] I] [raYir ee tio Vee) 


k=1 t= be 


where a4, by are non-negative integers. 

Theorem 2 provides a method for the resolution of 2.1 yielding the solution 
in multiplicative form. If further we select the ideal parameters in this solution 
of 2.1 so that the ideals X, and Y;, are principal we obtain the complete 
solution of 1.4 in rational integers. 


3. Lemmas. The following lemmas are required. 


LEMMA 1. Form the following array containing all the left residues of G modulo 


g3: 
Vi1,ig Vi2,ig + + « V1 afl), ts 
V21,49 V22,ig + + + V1 a(2), i 
Vs(i,p1,t7 + + + Ys(t, pals), i7 


where v1.4; 1s a left residue modulo g, not in the first e — 1 rows and the elements 
of the eth row are all the distinct left residues modulo g, of the elements of the right 
Cosel 24 Ve1-i3- 
Then if A is an ideal belonging to g, all the distinct conjugates of A are given 
without repetition by 
Ver-ig A 


fone=1,2,...,s(4,f);f = 1,2,...,a(e). 
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This follows since the array contains all the left residues of G modulo g, 
and no residue appears twice. 
In §6 we shall refer to the above array as the v(i, j) array for G. 


LEMMA 2. Let A be unaltered by g, and let B be the product of all prime ideals 
belonging to the conjugate set S, which divide A. Then B can be expressed in the 


form 
e s(i,7) a(e) Cre 
I] I] I] Ves, igh 5.1 ’ 
r=1 e=l f=l 
where the v are as described in Lemma | and the c,, are non-negative integers. 


Proof. Since the only primes belonging to the conjugate set S, are the con- 
jugates of ideals belonging to g, then by Lemma 1, B must be of the form 


r s(t. 7) ale) Gres 
B= T(E fours.) 


r= e=l f=l 


But B is the product of all primes belonging to the same conjugate set which 
divide an ideal which is unaltered by g; and so B is also unaltered by g,. It 
follows then that if »,1.,;P,,, divides B so does ov,;,,,P;,, for each o € gy. 
However not all these divisors are distinct. The distinct images of P,, by the 
substitutions of the coset g;v,1,,, are those obtained by the distinct left resi- 
dues of g;¥.:,4; modulo g,, i.e. by the substitutions v1.45, 62,44 - - « » Veale). ty 
Hence d,.) = drop = .. . = Areate) = Cre, SAY Since by Lemma | the conjugates 
Ver.1y Ps,, are all distinct. 


LemMA 3. Any left residue of G modulo g, which appears in the coset rg, occurs 
there with multiplicity equal to o(g;(\ g;). 


Proof. lf is in the intersection of the cosets rg, and og, then these cosets 
both contain ov for all » € g;\g,, and these are the only elements in their 
intersection. 


LemMA 4. Let 71, 72,...,7_ be a complete set of left residues of G modulo g, 
and let o1,02,...,0 be a complete set of left residues of g, modulo g,. Then 
among the products t,0, (r = 1,...,@; s =1,...,6) appear all the left 


residues of G modulo g, and each residue occurs with multiplicity o(g,)/o(g:(\g,)- 


Proof. The cosets r,g; (r = 1,2,...,a@) contain all the left residues of 
G modulo g, each with multiplicity o(g,). In each coset r, g; a given left residue 
modulo g; which appears there occurs, by Lemma 3, with multiplicity 
o(g,\g,). Hence a given left residue of Gmodulog, must appear in 
o(g,)/o(g:(\ g,) of the cosets r,g;. Since all the residues ro,..., 70» for 
given r are distinct modulo g, and they are all the distinct left residues of rg, 
the theorem follows. 


Lemma 5. Let r1,..., 7, be a complete set of left residues of G modulo g, and 
let ow, ..., ov for o © g, be a complete set of left residues of gw modulo g,. 
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Then among the products r,¢0, (r = 1,...,a@; s = 1,...,6) appear all the 
left residues of G modulo vg, v~' each appearing with multiplicity 
o(g;)/o(g, C\ vg, vr"). 


This follows from Lemma 4, since o;, o2,...,¢, is a complete set of left 
residues of g, modulo vg, ». 


4. Fundamental theorem. We introduce some additional notation. The 
notation A ;. denotes that ideal A, is the product of all ideals belonging to 
the conjugate set S, which divide an ideal unaltered by g;, and ry, . . . , Tico 
denotes a complete set of left residues of G modulo g,; y(m, ij) will denote the 
quotient 

0(g5)/0(gs OV’ ¥mi.i5 By Vm, 49) 
where the » are as prescribed in Lemma 1; also s(i, 7), which we shall denote 
by s in this section, is as defined in Lemma 1. 


THEOsEM 1. All solutions of 


rh t t 

4.1 I] I] (rTuA jiih ++ TioAj:u) = I IT ¢ (raBy-u see roB;:a)"° 
k=1 i=l = i=1 

are given by 


4.2 Aja = 8 1 (v2y""",, Bye = TL TD 


e=l q=1 em] 
where 
r(e) + . 
V je = Wer.ts] j.¢+ + + Meale)l j¢ 
and 
m , Mos. ix; 2 naan (g=1,2,...,w) 
ql,iky ae | qs,iks ql, iks » qs,itk q +=) ’ 


are all the distinct primitive solutions x, y of the linear system 


h 


4.3 p>: > > care, ij) Xen = 9 > S bavi (€, ij) Ye. w- 


=l i= i=l e=l 


Proof. From Lemma 2 it is seen that A ,.y is of the form 


4.4 Aja = I] I] (ii Per.09P's : 


and there is a similar expression for B,.y with c,,,4 replaced by d,,. 4. Sub- 
stituting 4.4 in 4.1 gives 


s ale) o(i) 


I Il I Il I] I] (7 re7.03P 5.7)" 


i=m1 r=—l e=l f=l d=] 
o s ale) g(t) 


= IT [] I] IT I] I] @ wes. ish’ j.2) * mee 


k=1 i=l f= = 


which by Lemma 5 becomes 
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h t r 


45 I] II I] I [rPs7--- tue? Or oo 


k=l i=l rel e=l 
id 


h t sf 
I] IT I] I] [raP;.s tee Ti ph; ” ae ——) 


k=l i=l rel e=l 
and this is equivalent to 


a h t e 


IT I I [raP,, Pare Ti pP 5.2] r Po = |] I] I] [raP,, reese TeipPy came 
k=l i=l rel bak ‘ted gon 
where 


Cae Dd Cre. wv (e, ij), d;.u = D dv «ey (e, ij). 
e=1 


e=l 


This relationship is satisfied if and only if 


h 


4.6 Y Daacne = Y 3 buds a Se | 


=—l i= i=1 


This is a linear system of equations to be solved for the non-negative 
integers Cr. «u, dre, For fixed r let the w primitive solutions (5; p. 9) be 


Moi, tks re | Mos. ik; Moliiky +++ Nas, tk 
for «= 1,2,...,¢#; R=1,2,...,h4 and g=1,2,...,w. Then all non- 


negative solutions of 4.6 are given by 


wv 
4.7 Creu = > UgrM ge. tks dre, > Ur ae, ik 
q=l 


q=l 


for integer parameters u,,. Substituting 4.7 in 4.4 and putting 


IT (P;.7)"" = T5¢ 


we get 
A §;; a> nt I i (Vey, ig] 4. ” ioe “ B;, _ ee -T] IT | I] (Vey, ij] j.r) r yn ~ 
q= = [= q= = 


which we can write as 
T a” asd r(2)\ mes.ik r(8)\ Moe. 
Aja = 8 ae “ae... 
and similar expression for B,. with m,;, replaced by m,;, «. 


THEOREM 2. Let the complete solution in multiplicative form of the equation 


h t 8 h t s 
vle,ifaie yle, i)die 
4.8 TT TT Ty a" = TT TT I 
k=l t=1 e=l k=l tel e=l 


in the rational domain be 


w w 
4.9 Xen = I] c"", %.0° [] . “Eos er" 


q=1 q=1 
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Then the complete solution of (4.1) is given by 


s . 
A j:% = I] Xe. iks Bj: = I] Ve. iks 
e=1 


e=1 


where Xin» Ve, are as stated in 4.9 but with t, replaced by 
gr = Ye1.ig] j.¢ “+ Veate).1j1 4 ¢ 


Proof. lf 4.9 is the complete solution in multiplicative form of 4.8 then 
Mee, ix ANA Noe are the primitive solutions of equation 4.3 (6, p. 70). Thus the 
values of A ;. and B;., in 4.1 are precisely the relations 4.2. 


5. The algorithm. We can now furnish a procedure for the resolution of the 
ideal equation 


a t 
5.1 I] I] (rX at ieX te... Tig oXin) t 


I] (raYiati Vix see tioV 2)" 


h 
k=1 i=l 


With A ,. as defined in §4 we can write 


t t 
5.2 Xin = I] Asa, Yin = I] B i: 
I= i= 


Substituting 5.2 into 5.1, and since a given prime ideal belongs to only one 
conjugate set, we can equate the product of primes belonging to the same 
conjugate set and we get the system 


h t 
5.3 I] I] (TaA 5:7 2A 5% see Tigi nA ia)" 


k=1 i=l 
a t 


= I] (raBy-at eBy:x--- Ti0B;:a)" fi?) eer | 
k=1 i=l 
Each equation of this system is of the type 4.1 and can be solved by the 
method given there. Substituting the expressions found in this way for 
A ;., Bj. into 5.2 gives the complete solution in multiplicative form of the 
ideal equation 5.1. 


Remark. Theorem 2 was stated and proved only for the case when 4.1 and 
hence the associated system 4.8 consists of two equal products. However the 
theorem also holds when 4.1 consists of a finite number of equal products. 
This follows since Ward's result on the correspondence of the solutions of 
additive and multiplicative equations holds for multiplicative systems with a 
finite number of equal products. 

It will also be seen from the example considered in §6 that the number of 
parameters in the complete solution of 5.1 obtained here can be reduced. No 
general results are given here which will yield the solution in terms of a mini- 
mum number of parameters. 





ie 
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6. Example. For purposes of illustrating the algorithm we shall consider a 
very simple equation, one which splits in a field with group G (1, ¢) where 
a? = 1. Representatives from each of the complete conjugate sets are g; = G, 
£3 = 1. 

Let us consider the problem of solving completely the ideal equation which 
arises in the complete resolution in rational integers of the equation x*+ay? = 2", 
namely, 


6.1 Xo1.0X91 -_ Yiu. 
Putting 
6.2 Xo = Ai, Aon and Yin = Bias Bon, 


then for equation (6.1) the relations (5.3) become 

A 4.2.0A i = Bn Gj =1, 2), 
and these equations must be solved for each j. For this purpose we require the 
v(1, 7) and »(2, 7) sets as defined in Lemma 1 and the corresponding multiplica- 
tive equation in the rational domain with its parametric solution. This data is 


listed below for 7 = 1,2 and the corresponding values of A j.2:, By.1, are at 
once written down. 


J=1. man =1;- 50,1) = 170,11) = 1. 
Yu2 = i: s(2, 1) == 1, y(1, 21) = 2. 


xin = Yiu X12 = 4, V1.1 = ti. 
Aina = Tia Bia = Tih. 
j=2. mix =1, v0.12 = o; s(1,2) = 1, y(1,12) =1 
V1.2 = 1 
¥21,22 = @; s(2, 2)= 2, y(1, 22) = 1, y(2, 22) = 1 
X1,21%2,01 = Yiu > X12 = eS 
Xeon = tels.. . far, 


U1 = byte eee ba+1- 
Az: = T3:Tts - a T2.n-0(T 2,272 re eee 
Bony = T21T2,2.. . T2,n41-0(T 2,17 2,2 . . « T2041). 


Substituting these expression for A j.2:, B j:1; into (6.2) yields the following for 
the complete solution of (6.1), 


Xo = Tia(T2iT22'.. . T2.n).0(T 2,272.3... T2.n+1), 
Yi Ti.1(T 2,17 2,2 eee T2.n41)-6(T2,1T 2,2 —yie 


In conclusion we list the result obtained when the method of this paper is 
applied to an equation which splits in a field whose group G is the group of 
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symmetries of the square. G is then a group of order eight generated by 
o, 7 where r? = of = 1, ro = o*7, ra? = o*r, ro? = or. Representatives from 
each of the complete conjugate sets can be take.1 tobe g; = G,ge = (1, ¢, 0°, ¢*), 
g3 = (1, 0, r, or), gs = (1, 0’, or, o*r), gs = (1, 07), go = (1,7), 7 = (1, e7), 
gs = Tf. 


Then by the method of this paper the complete solution of the ideal equation 
6.3 X61 . aXe. . o7X61 ° o* X61 = Ue. . tU2, 
is found to be 


1+rTar yl+ 
X61 Noy! 3,2Ve1 
r 3 ry r *+e3 
U21 = N23 N 32 NV ead se ’ 
where the notation 7%**’* is used for ¢7*. 77°. Equation 6.3 arises, for ex- 
ample, in the complete resolution in rational integers of the equation 


N(a + 50 + cé? + dé*) = u? + 0? 
where 6 = ‘\/a. 
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AN INHOMOGENEOUS MINIMUM FOR NON-CONVEX 
STAR-REGIONS WITH HEXAGONAL SYMMETRY 


R. P. BAMBAH anv K. ROGERS 


1. Introduction. Several authors have proved theorems of the following 
type: 

Let xo, ¥o be any real numbers. Then for certain functions f(x,y), there exist 
numbers x, y such that 


1.1 x =X, Y= yo (mod 1), 
and 
1.2 If(x, y)| < max [|f(4, 0)|, [f, 4], Lf, DI, FG, — DIL 


The first result of this type, but with |f(4, 4)|, |f(4, —4)| replaced by 
min |f(4, +4)|, was given by Barnes (3) for the case when the function is an 
indefinite binary quadratic form. A generalisation of this was proved by ele- 
mentary geometry by K. Rogers (6). Bambah (1) proved the theorem for 
binary cubic forms with three real linear factors, and Chalk (4) proved the 
same result for binary cubic forms with only one real linear factor. Mordell 
(5) generalised Chalk’s result and proved that for functions f(x, y) satisfying 
certain conditions, including the condition 


1.3 f(x, »)| < Rif (2x, 2y)|, 
for some k independent of x, y, one can find x, y to satisfy 1.1 and also 
1.4 If (x, y)| < &. max [|f(1, 0)], [f, 1)|, If, DI, If, —D)1- 


Any function satisfying 1.3 and 1.2 also satisfies 1.4, and in fact one can 
modify Mordell’s proof very slightly to get the theorem in the form 1.2 without 
imposing a condition 1.3. It is only when the function is not homogeneous that 
the results differ. 

Since Mordell’s and Rogers’ papers were elementary generalisations to cer- 
tain regions with one and two asymptotes respectively of the results of Chalk 
and Barnes, it might be interesting to see what properties of a region 
f(x, y) < 1 with three asymptotes through the origin are necessary in order 
that the result 1.2 may be proved. In this way, the essential property of the 
binary cubic required in Bambah’s theorem is revealed, namely that the 
region can be transformed by a linear transformation into one with hexagonal 
symmetry. 


2. Equivalent forms of the theorem, and some lemmas. Let /,0l,, /20ls, 
1;0l, be three lines through the origin O such that O/,, Ol; make angles of 60° 
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and 120° respectively with Ol,. Bambah (2) defines a star-region MN as having 
hexagonal symmetry if 


(i) N is symmetric with respect to the lines 1,Ol,, 1,0ls, 1,Ol, and their 
bisectors, 


(ii) the boundary % of N either terminates in the lines 1,O0l,, . . . , 1sOl¢ or 
has them as asymptotes, 


(iii) the region external to NM and lying between Ol, and Ol, is convex, 
(iv) each of the six branches of % is a continuous curve. 


The arithmetical form of the result to be proved is the following. 


THEOREM 1. Suppose the region f(x,y) < 1 has hexagonal symmetry, and let 
a, B, y, 5 be fixed real numbers with ab — By # 0. Then for any real tuo, vo there 
exist numbers (u,v) = (uo, 00) (mod 1), such that 


a y B 4 atB8,y+6 
flea + Bo, 10 + &») < max| (5,2), bt) 2°’ 2 )|. 


It is usually convenient to state the theorem geometrically. Let A be the 
lattice generated by the points A(a,7) and B(8,é), and let C= A+B. 
Let ® denote the region given by 


ys)<ma Ag 3) 48.8) A2$2.24)] 


Then ® is a region with hexagonal symmetry which contains the points 
+43A, +43B, +}3(A+B). We denote by N+P the region obtained from N by 
the translation which moves O to P. Then the above theorem is equivalent to 
the following. 


THEOREM 2. The parellelogram OACB, and hence the whole plane, is covered 
completely by the regions N+ P,P € A. 


Let @ be any automorph of 9%, and write QA = A’,QA = A’,OB = B’. 
Then ® contains +}$A’, +4$B’, +4$(A’+B’), and the plane is covered by 
MN + A if and only if it is covered by MN + A’. There is no loss of generality 
if we choose co-ordinates so that one asymptote is the x-axis. Then, since 
rotations through 60° or reflections in the axes are automorphisms of N, we 
can suppose by a suitable choice of 2 that one of the pairs 2A, 2B; QB, 2A; 
2A,2(—B); 2(—B), QA, which we can still call (A, B), satisfies 

(1) OA < OB, 

(2) the angle AOB is acute, 

(3) the rotation from OA to OB is anti-clockwise, 

(4) A lies in the region 0 < y < x3. 

Then we have only to prove the following: 
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Let A be a lattice generated by points A, B satisfying (1), (2), (3), (4). Let N 
be a region, say f(x,y) < k, with hexagonal symmetry which contains the points 
4A, 4B,4(A+B), and for which the x-axis is an asymptote. Then the regions 
MN + A cover the parallelogram OACB and hence the plane. 


For the proof we need three lemmas: 


(a) Let PQRS be a parallelogram with PQ and PS parallel to Ol, and Oly; 
respectively (i = 1,...,6; Ol; = Ol;). Define N’ by f(x,y) < f(a, 8), where 
a, 8 are the co-ordinates of the point 4(R — P). Then PQRS is covered by 
W+PiN +R. 

The region 9’ + P has as part of its boundary an arc which passes through 
the point 4(R — P) + P = 4(R + P) and has PQ, PS as asymptotes. For 
the region Jt’ + R, the corresponding point and asymptotes are —4(R — P) 
+ R = 3(P + R) and RS, RQ. The two arcs have a common tac-line at the 
point 4(P + R) and so do not cross in the parallelogram PQRS. Hence every 
point of the parallelogram is covered by one or other of the two regions, the 
small regions near Q, S in fact being covered twice. 

(b) Let PQR be an equilateral triangle with sides parallel to Ol,, Olz, Oly in 
some order. Let N’ be the region f(x, y) < f(4vV3, 4), where y is the distance 
of P from QR. Then PQR is covered by N’ + P. 

Since the point (}7+/3, }7) lies on the bisector of the angle /,Ol., a tac-line 
there to the boundary of J’ is parallel to Ol;. Hence the boundary of N’ + P 
has an arc which passes through the foot of the perpendicular from P to QR, 
touches QR at this point, and has PQ, PR as asymptotes; and so PQR is 
covered. 


(c) If 0 < 91 < x1V3, and x; < x2, then f(x, y1) < f(x 91). 


This follows immediately from the convexity and the relationship of the 
region 9 to the x-axis. 





(4,B)+P 





P r Q 


Figure 1 
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R/ 








Figure 2 


3. Proof of the theorem. The conditions (1) to (4) on A, B imply that A 
lies between Ol, and Ol:, and B lies between Ol, and Ol,,;, where i = 1, 2 or 3. 
When B lies between O/, and Ol;, denote the point of intersection of Ol, with 
the line through B parallel to Ol, by F and that of Ol, with the line through A 
parallel to Ol, by G. Taking Ol, to be the positive x-axis, we have the following 
cases to consider. 

(i) B lies between Ol, and Ol:. 

(ii) B lies between Ol, and Ol;, and F lies above G, in the sense that the 
ordinate of F is not less than the ordinate of G. 

(iii) B lies between Ol, and Ol;, and F lies below G. 

(iv) B lies between O/; and Ol,, and B is above A. 

(v) B lies between O/; and Ol,, and B is below A. 

Let the co-ordinates of A and B be (, g) and (r, s) respectively. 

(i) 








0 


Figure 3 


By (a), the parallelogram bounded by the lines through O and C parallel to 
Ol,, Olz is covered by N, N+ C, and hence OACB is covered. 








aay > 


le 
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(ii) 


c 





Figure 4 





gy 





Figure 5 


Since OA < OB, and since B lies in the second sector while A lies in the first, 
we have 


ya < OA sin 4x < OB sin 4x < yz, 


and so B lies above A. Draw lines ADG, AE parallel to Ol, and Ol;, and lines 
BFD and BE parallel to Ol, and Ol; respectively. Since F lies above G, it is 
clear that D is in the first sector. The treatment differs according as C is in the 
second or first sector, the corresponding figures being respectively (4) and (5), 
but we do not separate into cases until it is necessary. 

By (a), we see that ADBE is covered by NR +A, N+ B, OAG by MR, 
NM + A, and OFB by N, N + B. Because of the symmetry of OACB about its 
centre, it will now be enough to prove that the triangle DFG is covered by 
N,N+A,N+BLN+ C. 
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Since the distances of DF from O and FG from A are }(s + rx/3) and 
4(p/3 — q) respectively, it follows from (b) that DFG is covered by each of 
the sets a : . 

f(x, 9) < FAV B(s + rr), Us + V3), 
and ” : . 
U(, ») <flUAV3OV3 — ¢). 1OV3 - gD) +4. 
Now since B lies between Ol; and Ol;, we have 0 < |r|./3 < s, and we take 
in turn the cases when r is negative and when r is positive: 


(I) 0 < —rvV/3 < s: by reflection in Oy, then rotation through 47 in the 
clockwise direction, we deduce in turn that 


f(4r, 4s) = f(—4r, 4s) 
- {=i staf) 
a 4 4 


> f(AV3(s + 1/3), Us + V3), 
by (c), since for r < 0 we have s\/3 — r > sx/3 + 3r. Hence, in this case 
DFG is covered by the region f(x, y) < f(4r, $s) and so certainly by %. 
(Il) 0 < rvV/3 < s: by rotation through } in the clockwise direction, we 


see that 7 
fbr) = KES 2) 


4 
> SAV 3B(s — rv/3), Hs — rv), 


the inequality following from (c), since forr > Owe haver + s./3 > sV/3—3r. 
Thus, if we have s — rx/3 > pvV3 — 9, we can conclude that 


f(4r, 4s) > FAV3(O-V/3 — 9), HOV3 — @)), 

and so, by the remarks preceding (I), we see that DFG is covered by N + A. 

Now suppose that s — 71/3 < p./3 — g. This means that C is in the first 
sector, since the distance of A from the line through C parallel to Ol; is 
4(s — r/3), while the distance of A from Ol, is }(p»/3 — q). The figure is as 
in Figure 5 and there are two possibilities: 

Either, the triangle DFG lies completely between O/, and the line through C 
parallel to Ol; and is consequently covered by N, N + C; 

Or, the line through C parallel to Ol; cuts the lines DF, DG at points H, K, 
thus dividing DFG into the triangle DHK and the trapezium FGKH. Of these, 
FGKH is covered by R, N + C, and DKH is covered by 


U(x, ¥) < f{4NV/B(s — rV/8), Hs — r/3)}] + A, 


since }(s — rx/3) is the distance of HK from A. Hence, by the inequality 
above, we see that DKH is covered by % + A. This completes the investiga- 
tion of case (ii). 
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rs) 1 
Figure 6 
Cc 
B Z 
K A, 
2 
3 
- A 
0 Qa, 
Figure 7 


(iii) In this case, as F lies below G, D is in the second sector. Figures 6 and 7 
represent the situation when the line through C parallel to Ol, does not meet 
or does meet the triangle DFG. The case when C is in the first sector is not 
drawn but will be considered. As in (ii), we have only to show that DFG is 
covered by N, N+A,N+ BL,N+ C. 

Since the perpendicular distances from O to DG and from B to GF are q 
and $(s — rx/3) respectively, it is clear that DFG is covered by each of the sets 


f(x, 9) <f(4/3q, 49), 


and 
U(x, ¥) <f{d(s — 1/3) V3, Hs — rv/3)}] +B 


Now since A lies in the first sector, we have O < q < p+/3, but in fact we 
divide into cases according as q+/3 is not greater than or not less than . 
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(I) ¢v3 < ?. In this case, (c) implies that 
(4p, 49) > f(4-V3a, 44), 
and hence DFG is covered by %. 


(II) ¢vV/3 > P. By reflection in the line x = y+/3, the bisector of 1,01, we 
have 


f(4p, 4a) = fA + oV3), HOV — 9) 
V3(p\/3 — 9) (pr/3 — 9) 
> (Meson teva), 





using (c) at the second stage, since q\/3 > p implies that p+¢./3 > 3p—¢v3. 
Hence, using the second region above which we showed covered DFG, we see 
that DFG is covered by 9+ B if we make the further assumption that 
V3 — 9a >s — rx/3. This leaves the cases when £./3 — ¢ < s — rv/3. In 
this case B is nearer to the line through C parallel to Ol; than to OFG, so that 
certainly C is in the second sector. Figure 6 indicates the case when DFG 
lies entirely between O/, and the line through C parallel to Ol;; in this case, as 
before, DFG is covered by N, % + C. The case when the line through C 
parallel to Ol; meets DFG is shown in Figure 7, where it is seen that the triangle 
is divided into the trapezium FGKH and the triangle DHK. The trapezium is 
covered by 9, R + C, while DHK is covered by 


[i y) < {Vievs 9) nt = ‘) | 1B 


since the distance of B from HK is }(p»/3 — q). By the inequality proved under 
(II), we infer that DHK is covered by 


Lf(x, 9) < f(4P, 4¢)] + B, 
and hence, a fortiori, is covered by N + B. 


4, 





Cc 








Figure 8 
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(iv) By applications of (a), we have only to show that the triangle OFG is 


covered, as shown in Figure 8. Since the distance from B to OF is 4(s — r+/3), 
OFG is covered by 


[se y) < (ae = ry) += rv) | +B. 


Hence, if s — rx/3 < pvV3 — g, then OFG is covered by 
[see y) < ({Maevs — 9) e- ‘) | +B: 


and then, since the angle AOB is acute, A lies above the line y = x/+/3, hence 
qV3 > P, hence, as in (II) of (iii), 


{vievs — 9) ,o& - ‘) 











< f(4?, 4), 


and therefore OFG is covered by N + B. Finally, suppose that p./3 — ¢ 
< s — rx/3. Then C is in the second sector and, as before, we can show that 
OFG is covered by N, N + C, or by N, NR + C, and 


[ se, y) < (Viev3 — q) fa - ‘)] on 


Using the same inequality as above, we deduce the result. 
Case (v) is similar to (iv). In fact, since the relation OA < OB does not 
play any part in (iv), one can use the same proof after reflection in the y-axis. 
This completes the proof of Theorem 1. We note that the argument also 
shows that, if the regions are strictly convex, then strict inequality can be 
obtained in the statement of the theorem, except possibly when (wo, v) is 
congruent to one of (4, 0), (0, 3), (4, 4). 





4. Concluding remarks. The example 


S(x,y) = |xy(x — y)(x + y)| 


shows that the results for regions with one, two or three asymptotes do not 
have an analogue for general regions with four or more asymptotes. This is 
clear from the fact that for the above function the right-hand side of (1.2) 
is zero. 
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THE DISTRIBUTION OF TOTATIVES 
D. H. LEHMER 


1. Introduction. This paper is concerned with the numbers which are 
relatively prime to a given positive integer 


n= pi'pr’...p7 
where the #’s are the distinct prime factors of m. Since these numbers recur 


periodically with period nm, it suffices to study the ¢(m) numbers <n and 
relatively prime to n. Here 

(1) o(n) = n(1 — pi')(1 — pz")... (1 — 7’) 

is Euler’s function. Following Sylvester, these ¢(m) numbers are called the 
totatives of nm. One may ask how these totatives are distributed among the 
integers < n. Specifically we may divide the interval from 0 to m into k equal 
subintervals and consider the number of totatives in each of these subintervals. 
It is natural to suppose that these intervals are of more than unit length so that 
we shall suppose that » > & in what follows. The ambiguity of assigning an 
interval to a totative which occupies the common end point of two adjacent 
intervals does not arise. In fact if gn/k is a totative, m must divide k. But m > k. 


Hence we define, for each g = 0,1,...,%& — 1, the partial totient function 
$(k, g, m) as the number of totatives r for which 
(2) ng/k <7 < n(q + 1)/k. 


Alternatively, one may divide the unit circle into k equal arcs by the &th roots 
of unity and enquire about the number of primitive nth roots of unity in each 
such arc. It is clear that 


k—1 
(3) DL, o(k, gm) = o(1,0,m) = o(n). 
(= 
We shall be interested in the question of how uniformly the totatives are dis- 
tributed and so we introduce the function 
(4) E(k, q,n) = o(n) — ko(k, 9, %), 


which may be described as the excess of the number of all totatives of m over 
the number there would be if the totatives were everywhere as dense as they 
are in the interval 


(5) ng/k <x < n(q + 1)/k. 
The value of E(k, g, ”) is an integer, positive, negative, or zero. By (3) we have 


k-1 
(6) > Elk, g,) = 0. 
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2. Uniform distribution. ‘The vanishing of E(k, q¢,) is an indication of 
uniformly distributed totatives. For E(k, ¢g, m) to vanish even for one value 
of q it is necessary that ¢(m) be divisible by k. That this condition is not suffi- 
cient is seen from the example of m = 21 and k = 4. 


In this case ¢(m) = 12 is divisible by & but, as we shall see, 
E(4, 0,21) = E(4,3,21) = —4, 
E(4, 1,21) = E(4, 2,21) = 4, 
so that E(4, g, 21) never vanishes. 

If, for some k and n, the functions E(k,g,m) vanish for all values of q, 
then we say that the totatives of m are uniformly distributed with respect to k, 
there being $()/k totatives in each of the k intervals. For example, for every 
nm > 2 the totatives are uniformly distributed with respect to k = 2. That is 

E(2,0,) = E(2,1,”) = 0 (m > 2). 


This follows at once from the fact that if r is a totative, so also is m — r. 
Similarly we have 


THEOREM 1. [fn >k, E(k,g,n) = E(k, k — q — 1,m) for all values of q. 
THEOREM 2. If nm is divisible by k? then the totatives of n are uniformly distri- 
buted with respect to k, that is, 
E(k, q, hk?) = 0 (q=0,1,...,k—1). 
Proof. This follows at once if we consider the fact that all the totatives of 
n = hk® may be generated from those less than hk by adding successive mul- 
tiples of hk. In fact the integers 
t + ghk (q=0,1,...,8 —1;0 <7 < hk) 
are all totatives of m = hk? if r is, and every totative of is of this form. 


3. Auxiliary numerical functions. We proceed to develop formulas for 
E(k, q, m) in terms of simpler numerical functions. These functions are: 


u(n), Mobius’ function; 
(nm), Liouville’s function; 
6(m), the number of square-free divisors of n. 


All these functions, as well as ¢(m), are multiplicative, that is, if f is any one 
of these functions, then 


f(n) = I] s(¢7). 
For prime power arguments we have: 


w)=47} ifa = 1, 
HPI=\ 0 ifa>l 


A(p*) = (—1)*, 
0(p") = 2. 
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In particular, 
O(n)=2'= D> 1. 


5 \m, w(8)940 


We conclude this list of well-known facts by quoting the following formulas: 
(7) u(n/s) = 0,n > 1, 

\n 
(8) 2, G6) w(m/5) = A(m) O(n), 


where the sums, as indicated, range over all the divisors 6 of m. The second of 
these is due to Liouville (3). 


In what follows we denote by [x] the greatest integer < x. 


THEOREM 3. 


E(k,q,n") = ps ‘3 + 2] - | +19] u(n/é). 


bin 


Proof. By a theorem of Legendre (1, pp. 7-8) the number of totatives of n 
which do not exceed x is given by 


> [x/5] (8) = p> (ax /n] u(n/8). 


In particular, 


(9) o(n) = 2 5 u(n/8) 
and 
(10) o(k, q,n) = > {[8(q + 1)/k] — [6¢/k]} u(n/S). 


bin 
The theorem now follows from (4). 
For g = 0 we have the simple formula 


(11) E(k, 0,m) = 2) re(6) u(n/s), 
where r,(8) is the least positive remainder, 6 — k[4/k], on division of 6 by k. 


THEOREM 4. If n is divisible by a prime p of the form kx +- 1, then the totatives 
of n are uniformly distributed with respect to k, that is, 


E(k,q,n) = 0 eG os 0s k — 1). 
Proof. It suffices to show that in case 
n=p*m, p=kx+i1, pim, 
then $(k, g, ) is not a function of g. Now 
(12) o(k,g,m) = 2) u(m/8) g(k, g, 2°, 8), 


where 


coaron= BAe] [5 r 
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_ Since »(p*) = 0 for 8 > 1, we have 





g(k, 9, P*,8) = [eat | “as | | 2 | ee 7 0] ‘ [ . 
Let p* = kr + 1, p=! = ks + 1. Then: 
[]- a+ [3] 
[ee] - 0 +[¥]. | 
[ aia | = rig + 15 + [se+ 0] ; 
[eg]. ay + [OE 


Substituting, we find 
g(k, g, p*, 8) = (r — s)6 = 5 b(p*)/k. 


Since this is not a function of g, the theorem follows. 


Incidentally we may substitute the value obtained for g(k, ¢g, p*, 4) into 
(12) and get 


(8, am) =X w(m/s) 5 SL) — g(pr) 2) _ 9) 


as it should be. 

We define m as an “exceptional number with respect to k” in case n is divi- 
sible either by k? or by a prime of the form kx + 1. Theorems 2 and 4 together 
state that if m is an exceptional number with respect to k, then the totatives 
are uniformly distributed with respect to k. We may confine our attention 
in what follows to non-exceptional numbers. Every number is exceptional 
with respect to 2. Hence we consider k > 3. 

The cases k = 3, 4, 6, are sufficiently simple so that it is possible to give 
explicit formulas for E(k, g, 2). These we proceed to develop. Some of these 
were given by van der Corput and Kluyver (4). 


—~ 


4. The case k = 3. By (6) and Theorem 1 we see that 
E(3, 2,) = E(3,0,n), E(3,1,") = —2E(3, 0, 2). 
Hence it remains to find E(3, 0, 7). 


THEOREM 5. Let n be a non-exceptional number with respect to 3, then 


_ §—2A(n) 0(n), 3| n, 
E(3,0,n) = Bie 6(n), otherwise. 


Proof. By (11) we have 
(13) E(3, 0, ”) = p> r3(8) w(n/8). 
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Now let » = 3* m, where, since m is not exceptional and >3, m; is a non- 
empty product of primes of the form 3x — 1 anda = 0 or 1. If 4; be any 
divisor of m, then 


(5:1) = 73(6:) (mod 3) 


and since 3 — 2r3(6;) takes on the values +1 or —1 and is congruent to 
r3(6;) (mod 3), we have 


(43) =3-— 2r3(6;). 
Hence, by (13), 


—2E(3,0,n) = p> (—2r3(8)) w(m/5) = w(3*) D> (—2rs(8:)) w(m1/8;) 


&, Im, 


= u(3*) > (A(81) — 3) w(m1/d:) = u(3") > (51) w(m1/8:) 
= u(3*) A(m1) O(m1) = A(m) O(n), 
by (8). Now 
een ng if 3| m, 


6(m), otherwise. 


From this the theorem follows. 


It follows from the above that E(3,¢,m) vanishes only for exceptional 
numbers » and then vanishes for all g. 


5. The case k = 4. By use of (6) and Theorem 1 we find 
E(4, 1, n) = —E(4, 0, n), E(4, 2, n) = —E(4, 0, n), E(4, 3, n) = E(4, 0, n). 
Hence it suffices to consider E(4, 0, ). 


THEOREM 6. Let n > 4 be a non-exceptional number with respect to 4. Then 


—(m) O(n), if n is odd, 
E(4,0,2) = "a O(n), ifn = 2 (mod 4), 


0, otherwise. 


Proof. Let m = 2*m, where m; is a product of primes of the form 4x — 1. 
By (11) 


E(4,0,n) = p> r4(8) w(n/8). 


Since 74(8) vanishes when 4 is a multiple of 4 and is equal to 2 for other even 
5, we may write, in view of (7) 


E(4,0,n) = w(2") Du 74(8:) (1/5). 
Now, as in the proof of Theorem 5, 


2 — ra(61) = ACS). 
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Hence, by (7) and (8), 
E(4, 0, 2) 


— u(2") > (A(81) — 2) w(m1/81) 
— w(2") A(m1) O(n) 
(—1)*"*A(m) u(2*) 0(n) 2. 


Considering separately the cases a = 0,a = 1 and a > 1, we have the results 
stated in the theorem. 


6. The case k = 6. If we apply Theorem 1 we find 
(14) E(6,5,n”) = E(6,0,n), E(6,4,) = E(6,1,2), E(6,3,n2) = E(6, 2,2). 
Since 
(15) $(6, 0, m) + $(6, 1, ) = (3,0, 2), 
we have, by (4), 
E(6, 0, ») + E(6,1,”) = 2E(3, 0, 2). 
Hence by (6), (14), and (15), we have 
E(6, 2,”) = —2E(3, 0, 2), E(6,1,) = 2E(3,0,) — E(6, 0, n). 


Thus it remains only to find E(6, 0, 2). 
We have, as a special case of (11), 


(16) E(6,0,n) = p> r(5) u(n/8). 
|" 
Let us write m = 2* 3° m where m is prime to 6. We distinguish 5 cases. 


Case 1. a = 0,8 = 0. In this case the sum (16) extends over divisors 6 
which are prime to 6 so that 
re(5) = 4r3(6) — 3. 


Since n > 1, we have 


(17) E(6,0,) = 4) r3(8) u(m/d) = 4E(3, 0, 2). 
|n 
Case II. a = 1,8 = O. In this case we note that if hk is even 
(18) re(h) = 2r3($h). 
Hence 


E(6,0,n) = Pe {2rs(6) — r6(5)} w(m/(26)) 


= 2E(3, 0, $n) — E(6, 0, $n). 
Using case I we find 
(19) E(6,0,”) = —2E(3, 0, $n). 
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Case III. a > 2,8 = 0. In this case 2/5 contains the factor 4 when 3 is 
odd so that (16) has a contribution from only the even divisors 4. That is, 


E(6, 0, ) = Pe 14(2) u(m/(25)) = 2E(3, 0, $n) 
by (18). 


Case lV. 8 = 1. Here we note that 


(20) re(3h) = 3r2(h), 
and write 
E(6,0,m) = Jo {ra(88) — r0(8)} u(n/(88)) 


3E(2, 0, n) — E(6, 0, 4n) 
— E(6, 0, 4n). 


Thus case IV reduces to one of the preceding cases. 


Case V. 8 > 2. In this case »/é contains the factor 9 when 4 is not a multiple 
of 3, so that (16) becomes 


E(6,0,n) = Pe, 70(88) u(n/(38)) = 3E(2,0, in) = 0, 


in view of (20). Summing up the results of the 5 cases and applying Theorem 5 
we have 


THEOREM 7. Let n > 6 be a non-exceptional number with respect to 6. Write 
n = 2* 3° nm, where mn, is prime to 6. Then 
1 + 5u(2") 
1 — 7y(2") 


We see that the non-zero values of |E(6, g, ”)| are powers of 2 as in the cases 
k = 3,4. 





E(6, 0, ) = 2u*(3") A(m) O(n). 


7. Additional explicit formulas. Explicit formulas for E(k, q,m) in case 
¢(k) > 2 are in general lacking. We may remark, however, that 
E(12, 2,”) = 3E(4,0,) — 2E(6,0,") = E(12, 9, n) 
E(12,3,) = 4E(3,0,) — 3E(4,0,) = E(12, 8, 2), 
so that E(12, g, ) may be evaluated explicitly in the four cases g = 2, 3, 8, 9. 


The case where n is a product of distinct primes of the form kx — 1 is how- 
ever capable of treatment. In fact we have the following theorem. 


THEOREM 8. If n is the product of distinct primes of the form kx — | then: 


E(k,0,n) = E(k, k — 1,”) = $(2 — k) u(m) O(n) 
E(k, 1,") = E(k,2,n) =... = E(k, k — 2,n) = u(m) O(n). 
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Proof. Since every prime factor of is of the form kx — 1, every divisor 6 
of is of the form 
(21) 5 = mk + u(8), 


where m, is an integer > 0. 
If g = 0 then (9) gives 


E(k, 0,") = 5 u(n/s)\a - | ; } 
= 2 u(n/d){5 — kms + 4(u(6) — 1))} 
= 2, u(n/8){u() — tku(S)} 
= p(n) 6(n)(1 — 3). 
By Theorem 1, E(k, k — 1,”) = E(k, 0,n). If now 0 < ¢g < k — 1, we may 


show that 
[ 2 | bs [ e+ 8] 
k k 
is not a function of g as follows. By (21) 


| # | = [ om + wu) | = gm; + $(u(5) — 1), 


[stn] . Lom, ++ ava) = gms + mi + §(u(8) — 1). 
Hence 
[4™]-[¢]-- 
is not a function of g. Therefore by (9) 


E(k, 1,n) = E(k,2,") =... = E(k, k — 2,n). 


To find this common value we need only use the fact that the sum of all the E’s 
is zero. Thus 


(k — 2) E(k, 1, ") + 2E(k, 0,2) = 0, 
that is, (k — 2) E(k, 1, ) = 2(4k — 1) w(n) O(n) 
or E(k, 1,2) = p(n) 0(n). 
Thus the proof is complete. 


The explicit values obtained above for E(k, 0, ) show that for an infinity 
of k and n, |E| ¥ 0 and for these values 


E(k, 0, 2) # 0(6(n)), 


as 6(n) — ~. This contradicts a conjecture of Erdés. Vijayaraghavan (5) 
showed the invalidity of the conjecture by a different argument in 1951. 
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8. General estimates. On the other hand we give some general results which 
show that totatives are, after all, fairly evenly distributed. Thus the following 
theorem shows that any two ¢’s do not differ by as much as @(m) = O(n*) for 
any k. 


THEOREM 9. |¢(k, qi, 2) — $(R, 2, m)| < O(n). 
Proof. Denote by {q:, ¢2} the expression 
L(g: + 1) 6/k] — [q1 8/k] — [6/k] — [(g2 + 1) 6/k) + [92 5/k] + [6/R). 
For any real x, y the function 
[x + y] — [x] — by] 


takes on the values 0 or 1. Hence {q:, g2} takes on only 0, 1 or —1. 
But by (10), 


(k, gr, m) — (k, go, m) = pa u(n/5) {qr, go}, 


where the dash indicates that the summation extends over those divisors 6 of 
n for which u(n/5) # 0. Hence 


Jo(k, gum) — O(B, ge, m)| < Ze |u(ne/8)|| {G1 92}| < Qo 1 = O(n). 
As a consequence of Theorem 9 we have 


THEOREM 10. For every q, |E(k,q,")| < (k — 1) O(n). 


Proof. 
\E(k, g, n)| = |o(m) — ko(k, g, ")| = | D> (o(k, a1, ) — O(k, g, 2)} 
k-1 a 
< XL lolk, qi, n) — o(k, g,n)| < (k — 1) O(n). 


As a corollary we have 


o(n) — (k — 1) &(n) 








. < (bk, 9,2) < o(n) + ea 1) O(n) 
uniformly in g. For g = 0 we have the stronger statement, 
(22) $0) _ 4 0(n) < o(b,0,n) < 2) + gon). 
In fact, by (10), 
ok, 0,n) = 5 ninss)] | 
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Now 
IS] << Qo u(n/d) = 4 0(n). 
a(n/dmt 
From this (22) follows at once. 


9. Applications. We conclude with a few remarks about an application 
of the foregoing results. 

Let Q,(x) denote the irreducible polynomial whose roots are the ¢(m) primi- 
tive mth roots of unity. That is 


Q.(x) = [] (x — exp(2rir/n)) 


tT 


or 


(23) Q, (x) = I] (x* — 2x cos(24r/n) + 1). 


We suppose that m > 6 and x > 0 and ask for inequalities for Q,(x). Since the 
factors of (23) are monotone increasing functions of r, inequalities are easily 
obtained by subdividing the range 0 < +r < 4m into (for example) the four 
intervals 
O<r< gn, in <r < hn, in <r < 4n, in <r < jn 

and counting the number of totatives in these intervals. These numbers are 
respectively : 

A = $(6, 0,2), 

B = $(4,0,) — $(6, 0, 2), 

Cc ¢(3, 0, n) ons ¢(4, 0, 2), 

D = $(6, 2,n) = 3¢(n) — $(3, 0, 2). 

Thus we obtain the following inequalities: 

(24) Qn(x) > (% — 1) (x? — x + 1)9(x* + 1)°(x*? + x + 1), 
(25) Qn(x) < (x? — x + 1)4(x? + 1)9(x? + x + 1)°(x + 1)”. 
Estimates for A, B, C, D, may be obtained from (22) and give 

so(n) — 30(n) <A < 5G(m) + 40(n), 

¢(m) — O(n) < B<isd(m) + O(n), 

T2¢(m) — O(n) < C<rd(m) + 4(n), 

io(n) — $0(n) <D < id(m) + 40(n). 


Sharper inequalities, especially for certain types of ~, can be obtained from 
(24) and (25) by applying Theorems 2, 4, 5, 6, and 7. Similar results may be 
written down for x < 0. Such results are useful in discussing the existence of 
“characteristic prime factors’’ of a* — 6", Lucas’s functions and their generali- 
zations (2). Of course any such inequalities will not give the asymptotically 
correct result: 


Q,(x) = x°™ (1 — w(n)x-' + O(x-*)) (x — @), 


Their utility lies in the direction of actual inequalities for a fixed value of x. 
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SYSTEMS OF LINEAR CONGRUENCES 
A. T. BUTSON anv B. M. STEWART 


1. Introduction. On recent occasions papers have been presented concerned 
with the problem of solving a system of linear congruences. Apparently the 
authors were not aware that this problem was solved very neatly and com- 
pletely a long time ago by H. J. S. Smith (5; 6). One reason for this situation 
is that recent texts in the theory of numbers go only as far in the discussion 
of systems of congruences as one can with the most elementary tools; whereas 
older texts, such as the one by Stieltjes (8, pp. 284-377), devote so much space 
to the discussion of the requisite matric theory that the reader is liable to lose 
sight of the elegant results concerning systems of congruences. Perhaps the 
time has come to give a new exposition of this material, particularly since this 
can be done in rather short compass to an audience whose background may be 
assumed to include acquaintance with the invariant factors and the Smith 
normal form of a matrix with elements in a principal ideal ring, . 

In the final part of this paper we present some original work extending the 
discussion of systems of linear equations, and systems of linear congruences 
modulo an ideal, from the classical case over the rational domain to the case 
where the systems are over a set of integral elements, with a B-basis, belonging 
to an associative algebra. Here we assume knowledge of the Hermite normal 
form of a matrix with elements in a principal ideal ring. 


THe CLASSICAL CASE 


2. The problems. The coefficients, constants, and moduli in the following 
equations and congruences are assumed to be in a specified principal ideal ring 
$, such as the rational domain. For the system of linear equations in m 
unknowns represented by 


De x10 = ky, coe § ae 
we will use the matric notation 
(1) XA = K, 
where X is 1-by-n, A is n-by-p, and K is 1-by-p. 
The first problem is to determine when (1) has a solution X with elements 


in ¥, to find how many solutions there may be, and to give a method for 
actually obtaining the solution. 


For the system of linear congruences represented by 


> x, by, = g, (mod m,), J @ 1 2.cceh 
i=1 
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we note that if m = [m,, m2, ..., m,] is the least common multiple of the m,, 
then an equivalent system of congruences is given by 


2 1 asy = ky (mod m), r,s eer," 


where a;, = byr;, ky = gy}, and m = my;,. If we denote this system by 
(2) XA = K(mod m), 


then the second problem is to answer for (2) the same three questions we have 
listed for (1). 


3. Necessary and sufficient conditions for solving (1). There exist (4, 
Theorem 105.2) unimodular matrices U and V with elements in $, U being 
n-by-n and V being p-by-p, such that UA V = E is in Smith normal form, with 
zero elements everywhere except in the main diagonal where there may appear 
non-zero elements ¢), €2,..., €, (which are called invariant factors and which 
are uniquely determined up to associates in $) having the property that 
é, divides e,,; and either r < p C nmorr <n < p. 

Hence the system (1) may be replaced by the equivalent system 


(XU-)(UAV) = KV, 
so that by setting Y = XU-' and C = KV, we arrive at 
(3) YE = C. 
The system (3) is so simple that we can immediately conclude that necessary 
and sufficient conditions for its solution are as follows: 
(4) e, must divide c, += 1,2,...,7; c¢ =0,%>-F. 


If we define A’ = (%) as the augmented matrix of (1), then using the con- 
ventional block notation we have 


-_ , E 
(o {)4v- (0): 
so that a further transformation by unimodular matrices U’ and V’, where U’ 
is (w + 1)-by-(m + 1) and V’ is p-by-?, will take A’ into its Smith normal form, 
say U’(%)V’ = E’, which is ( + 1)-by-p with elements e’, in the main dia- 
gonal. Thus depending on the relative size of m and p, the conditions (4) may be 
given the following form: 


(5) pen: e&yme, += 1,2,...,p; 
(5) n<p: ey =e, i= 1,2,...,; and e,4; = 0. 


4. Necessary and sufficient conditions for solving (2). Since the congruence 
problem (2) requires the existence of elements ¢, in 8 such that 


tam + Qo xia = ky oo eee 
t=—_ 
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it is easy to replace the system of congruences (2) by an equivalent system of 
pi = p equations in m, = nm + p unknowns, say 

(6) XA +TM = K, 

where T is 1-by-p and M = mI, is a scalar p-by-p matrix. 

Since p; < m;, we apply the test (5) to the system (6). This requires us to 
compute the invariant factors of (%) and (4%). Fortunately this task is easy 
because of the form of M. Following an argument by Butson which is more 
direct than that used by Smith, we write 


(0 9-)(ar) ¥= Ga): 


Because e,; divides ¢,,:, we see that no further arrangement of columns is 
necessary and that the ith invariant factor of (%) is either (e,,m) when 
i <p <2, or is m when nm <i < p. 

Similarly, for (%,) the ith invariant factor is (e’;,m) when i < p < n; but 
when <p, the (m+ 1)st invariant facto- is (é,4:,m), and when 
n+1<i< p, the tth invariant factor is m. 

Hence the test (5) shows that the necessary and sufficient conditions for the 
solution of (2) are as follows: 


(7) pen: (ey,m) = (em), += 1,2,...,); 
(7!) n<p: (em) = (em), i= 1,2,...,; and (e941, m) = m. 


We note that the final condition in (7’) may be written e’,,; = 0(mod m). 


5. The number of solutions and their form. To determine how many 
solutions there are and actually to produce them, we return to (3), supposing 
that the necessary and sufficient conditions stated above are satisfied. 

In the case of equations we see from YE = C, that the first r of the y’s are 
determined uniquely by y,; = c,/e;, while the remaining m — r of the y's are 
arbitrary. The complete solution of (1) is then given by X = YU and involves 
m — r parameters. Of course, since U and V are not unique, the complete 
solution may be obtained in a variety of forms, differently expressed, but 
actually equivalent. 

In the case of congruences we consider solutions X’ and X of (2) to be dis- 
tinct only when X’ # X (mod m), i.e., when for at least one value of i we 
have x’, # x; (mod m). We see from YE = C (mod m), that the first r of the 
y's are determined by congruences of the form y, e; = c; (mod m). From 
properties of $ we know there are as many solutions y, which are incongruent 
mod m as there are residue classes of $, mod (e,, m). The remaining y’s are 
arbitrary, so for each of these there are as many solutions incongruent mod m 
as there are residue classes of $ (mod m). The solutions of (2) are given ex- 
plicitly by X = YU (mod m), so there are as many distinct solutions X as 
there are distinct solutions Y. (Moreover, we may check that x’, = x, (mod m) 


if and only if x’, = x, (mod m,), j=1,2,...,93 
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so there are the same number of incongruent solutions of the original system 
of congruences with moduli m, ms, . . . , my.) 

In particular, when § is the domain of rational integers, the above considera- 
tions show that there are exactly 


N = (e;, m)(e2, m) ... (e,, m) m™-" 


distinct solutions of (2). 


6. Example. We take § to be the rational integers and consider the system 


3x) oe Xe = 5, 
5x, + 3x2 = |], 


Using the notation of the preceding sections, we have 


- 0 1\/3 5\/l 3 l 0 
j= = = 
aa (; ~ \G 3)(0 o ({ ‘) a 


KV=(5 1)V=(6 4) =C, 


E 1 0 0\/1 0 1 0 
u'(é) - -5 -3 1,0 4}=[0 2)=E'. 
10 7 —-2/\5 14 0 0 

Since 4 = e; ¥ e’; = 2, it follows from (5) that the system has no solution 
in rational integers. 

Considering the same system mod m, we see from (7) that we must have 
(1, m) = (1, m) and (4, m) = (2, m). If m = 0 (mod 4), there are no solutions; 
if m = 1 or 3 (mod 4), there will be N = 1 solutions; and if m = 2 (mod 4), 
there will be N = 2 solutions. 

Thus if m = 10, we solve y; = 5, 4y2 = 14, for y; = 5, yo = 1; and y; = 5, 
¥2 = 6. Then from X = YU, we compute x; = 1, x2 = 2; and x; = 6, x. = 7, 
respectively. 


THE CASE OF INTEGRAL ELEMENTS OF AN ALGEBRA 


7. Sets of integral elements. Let %&f be an associative algebra defined over a 
field § and possessing a modulus ¢. Suppose that § contains the principal 
ideal ring $. Each element of a set © of elements of & will be called an integral 
element if the set has the following three properties: 

U (unity): the set contains the modulus e; 

C (closure): the set is closed under addition, subtraction, and multiplication; 


B (finite basis): the set contains elements €; = €, €2, . . . , & such that every 
element of the set is expressed uniquely in the form 
> a; & 


where each a, is in . 
As an example we may take k = 1 and obtain as © the ring § itself. Again 
when § is the rational field and § the rational domain, we see that © is a set 
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of integral elements (but not necessarily a maximal set) in the sense of Dickson 
(1, Chapter X). See also (3). 


8. The regular representations. If a = > a, « is any element of O, by 
properties C and B there must exist elements 7,, and s,, of $ such that 


ae, = (Da, 6) 6 = Dry €s j * a 
€j;a €(D a: &:) = J Sy €e, io: Seer | 
Hence with each a in © there are associated k-by-k matrices R(a) = (r,,) and 
S(a) = (s,,) with elements in §. 
If E indicates the 1-by-k matrix with elements ¢;, €2, . . . , &, then in matric 
notation we have 


(8) aE=ER(a), Ea = S(a)E’. 


Since ¢; = ¢, the first column of R(a) and the first row of S(a) consist of 
precisely the elements a, a2, . . . , @;. Then from property B it follows that the 
correspondences defined by (8) are both one-to-one. Moreover, it is easily 
shown that the correspondences are preserved under the addition and multi- 
plication operations of each system. Hence the matrices R(a) and the matrices 
S(a) provide isomorphic representations for Q, well known, respectively, as 
the first and second regular representations. 

If a and 8 are in OQ, we may use (8) and the fact that elements of 8 commute 
with elements of © to write 


R*(a)S(8)E7 = R7(a)E* 8 = alE™ B 
= alS(8)E* = S(8)aIE* = S(8)R*™(a)E’; 


then from property B, it follows that 
R™(a)S(8) = S(8)R7(a). 


In particular, letting A = (a;,a2,...,a,) and B = (b;, be,..., 5) be the 
first rows of R*™(a) and S(8), respectively, we obtain the useful relation 
(9) AS(8) = BR™(a). 


9. Systems of linear equations over OQ. We consider the following system of 
p linear equations in m unknowns: 


(10) De as x4 Bey = Ys j=1,2,...,2, 


where the a;,, 8,;, and yy; are given elements of Q. Since O is not necessarily 
commutative, note that coefficients are allowed on both sides of the unknowns. 
We are concerned to establish necessary and sufficient conditions that (10) 
have solutions x1, x2, .--, Xn Which are in Q. 

If we assume that such solutions exist we may write x; = >> x, €;, where 
the x,, are in $, and define X; = (xu,..., 2X). Supposing y, = } cy €;, we 
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define C, = (cy,..., Cy). We define A,, to be the first row of S(a,,). Then 
(8), (9) and (10) imply that 


C; E* - ) Ay E* Xt By = > Ai S(x:) E” Buy 
- 4 XxX; R™ (as) S(8,;) E’. 


Hence property B implies that 


Cy= 2 XR" (aus) S(B.), a. ee 
t= 
We set 
C= (Cu, ++» Cigy Coty - ++ 5 Cony Coty +s» Cor)» 
xX = Beh 2 44 eas Gites + op ee os St, «2 oe Mas 


and A = (R*(a,;) S(8:;)) where C is 1-by-pk, X is 1-by-nk, and the “enlarged 
coefficient matrix” A is nk-by-pk made up of k-by-k blocks of which the one 
in the ij-position is R* (a;,;) S(8,;). Then the equations obtained above may be 
written as the single matric equation 


(11) XA = C. 


Except for the size of the matrices involved, (11) is precisely a system of the 
classical type (1) with kp equations in kn unknowns, with the elements in- 
volved all in $. 

Conversely, if (11) has a solution X in B, we can retrace the steps above to 
obtain in © a solution of (10). Moreover, by property B, distinct solutions of 
(11) lead to distinct solutions of (10). 

Thus the problem of solving (10) in © has been shown equivalent to solving 
(11) in $B. Referring to (5) and (5’) we can assert that if e:, e2,... are the 
invariant factors of A and if e’;, e’2,... are the invariant factors of the aug- 
mented matrix (4), then necessary and sufficient conditions that the system 
(10) have a solution are that 


(12) pean: og ze, +2 1,2,..., kp; 
(12) n<p: ey me, +2 1,2,..., km; and epey:’ = 0. 


Determining the number of solutions and the most general solution proceeds 
along the lines given in §5. In these matters it is worth a word of caution that 
the rank of A need not be a multiple of &. 

We note, thanks to the referee, that one type of matric equation, well known 
in the literature, is included in the above discussion. For if the algebra W is a 
total matric algebra of order k = n’, having the natural basis of elements €,,, 
where ¢€,,; is an m-by-m matrix with 1 in the ij position and zeros elsewhere, so 
that the multiplication table is 


€ij €Ers = 55, €isy 


Ome = (eis, « « «5 Cand Gttp << + Ghat ++ 3 Gite soe ep Qed Chem the typical 
element 8 = > di; €:;, which we ordinarily represent as B = (5,,), has the 
regular representations 


R(g) = 1I- XB, S(s) =B- XI, 
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where M- X B indicates the direct product matrix which is n*-by-m? and whose 
1j block is Mb,,. Then a linear equation like a x 8 = y is replaced, according to 
the theory above for passing from (10) to (11), by an equation X’ D’ = C’, 
where 

D’ = R™(a) S(8) = (I- KX A)™(B- X I) = B- X A™ 


and X’ and C’ are 1-by-n?, obtained from X = (x,,) and C = (c,,), respectively, 
by taking row blocks. The general linear equation in one unknown 
> a: x 8; = y may be treated in the same manner, the enlarged coefficient 
matrix being D” = >> B,- X A,’. This is the nivellateur studied by Sylvester 
(9), however, only for the case 8 = §. 

Similarly, the system of equations (10) may be generalized to allow each 
unknown to appear in a finite number of summands in each equation; the 
technique for passing to (11) remains the same, except each component block 
of the enlarged coefficient matrix will now be a sum of matrices of the type 


R® (ai) S(8;;). 


10. Minimal bases for ideals in 2. In the usual manner the left ideal M 


generated by £1, f2,..., 1, a given set of elements of QO, is defined to be the 
set of elements 
t 


dD vhs 


i=1 


obtained by allowing the left-multipliers », to vary independently over all of 
©). A minimal basis for the ideal M is by definition a set of elements yw, wo, ... , 
u, such that an element of © is in the ideal I if and only if it can be repre- 
sented in the form 


s 
> cms 


i=1 


where the c, are in $; and this representation is to be unique. 
An argument by MacDuffee (2) shows that if H is the uniquely determined 
left-Hermite form of the matrix 
S($1) 
Bas ice. 
S(t.) 
then the non-zero rows H, of H determine a minimal basis for IM, having 
s < k, by the relation u, = H,E*. The notation which we have been using 
makes it simple to reproduce the proof. 
Let U be a unimodular matrix, ki-by-kt, having elements in $, such that 
US = H. Let V = U-", so that S = VH. If the ith row of U is divided into 
1-by-k blocks U,,, then 


ii = H,E* - ) Ui; S(F5) E’ = >» Uy E* [; = LD vs 4 


where »,;, = U,,; E” is in Q, hence y; is in the ideal M, and so are all }- c; uy. 
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Conversely, given any element » = >> », {, in the ideal, we have », = > my, ¢, 
and if we define N,; = (mq, ..., 2a) we can write », = N,E*. Hence 


y= > N,ETS, = > N,S(g,) ET = NSE* = NVHE? = ¥ oyu, 


where N is 1-by-kt, made up of the 1-by-k blocks N,, and where c, is the 
element in the ith column of NV. Since ¢; is in 8, a representation of the desired 
type for vy has been found. The uniqueness of the representation follows from 
the independence of the non-zero rows in the canonical left-Hermite form H. 

In an analogous way we define the right-ideal generated by f1, fs, ..., f; 
to be the set of elements >> £; 9; obtained by allowing the right-multipliers n, 
to vary independently over Q. In this case a minimal basis can be found by 
computing the left-Hermite form: D of the matrix R which is kt-by-k with its 
ith k-by-k block being R7({,); for if D,,...,D, are the non-zero rows of D, 
necessarily with r < k, then the elements 5, = D, E* serve as a minimal basis. 

By combining these observations we can find a minimal basis for the two- 
sided ideal generated by £1, f2. ... , {, whose typical element is 


@ 
) Vij Css 
j=l 


where the g, are all finite. For we may first compute a minimal basis 
M1, ¥2,-.- Ms for the left ideal generated by £1, f2,...,£, and replace each 
Vij o% by , i Cijm Um- Then 


. ¢ 4 
a= 2 bm "ms where nm = > De Com Niz 
m= ae 


t 
a= 
=l 


Hence if, secondly, we compute a minimal basis 4), 52, . . . , 5, for the right ideal 
generated by 1, ue, ..., “,, we shall have arrived at a suitable minimal basis 
5, d2,..., 6, for the two-sided ideal generated by {1, f2,..., ¢¢. 


However, not every matrix H in left-Hermite form represents a minimal 
basis for an ideal of © (2, p. 76). 

When a and £ are in Q, by the notation a = 8 (mod Jt) we mean that 
a — B is in the ideal I2 and we say that a and @ are in the same residue class 
mod 9. For the sequel it is important to notice that, in general, it is only 
when the ideal M is two-sided that multiplication of residue classes mod JM is 
well defined. 


11. Systems of linear congruences modulo ideals. Over © we consider the 
following system of p linear congruences, modulo ideals of Q, in m unknowns: 


(13) De as x1 Biy = 5 (mod M,), j= » erry 


We shall assume as explained in §10 that for the ideal I2,, whether it be left, 
right, or two-sided, a minimal basis of s, elements has been found, say 


Mijs Boj, +++ 9 Masgs 


given by wy = H,,E* where the H,, are non-zero rows of a left-Hermite 
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k-by-k matrix. We let H, be the s;by-k matrix with rows H,,, so that H, is 
what is called an echelon row form. 


The system (13) is then equivalent to the system 


De aes x1 By + De tes Hes = Ys 7 fo Sef 
i= = 


where the unknowns ¢,; are in $. Following the same development and using 
the same notation as in §9, we may show that solving (13) in © is equivalent 
to solving in $ the following system: 


(14) (X 7)(4) = C, 


where 
T= (ti, « ee » e,23 S22, « es » be,33 « ee + bag, « ee » bag) 


and H is the direct sum H = H, + H,+...+H,. 

The number of unknowns in (14) is mk + s, wheres = > s,, and the number 
of equations is pk. 

If pk < nk + s, we apply (5) to obtain the conditions 


4 A 
(15) eA C -«(4), {= 1,2,..., 9% 
(15’) «(¢) - (4), Ew ccentee ee & 
‘ = 


0 ,t=nk+s4+1. 

Thus (15) and (15’) represent necessary and sufficient conditions for the 
solution of (13). 

If the solution is obtained as in §5, starting from (14), unnecessary para- 
meters may be noticed. We have made a further study of (14) in which the 
Smith forms D, of H, play a part, as well as the least common multiple m of the 
elements of all the D,. This method seems of some interest because of avoiding 
unnecessary parameters, but the alternative set of conditions which is obtained 
lacks the directness of (15) and (15’). This further study emphasized the need 
of care in the definitions of congruent solutions. 

Suppose that x1, x2,..-, Xx is a solution of (13). If all the ideals M, are 
two-sided and if 


(16) x's = xi (mod M,), 6= 1,3,...,0f @ 1,3... 0% 


then x’:, x’2,.--, x‘, is also a solution of (16). But if one or more of the ideals 
M, is one-sided, (16) is no longer sufficient to guarantee that x's, x’2,..., x's 
is a solution of (13). Having given these words of caution, we now define sets 
of solutions of (13) which satisfy (16) to be congruent sets. 
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In matric form (16) may be written 
X', —~ X, = Wy Hy, foe ae te eh Se 


Hence X’, — X, is a common left multiple of the H,. By repeated application 
of the method described in (7), there is a constructive way of finding M the 


least common left multiple of H,, H2,...,H,. Then (16) is equivalent to the 
existence of Q, with elements in $ such that 

X’, —~ X, = Q,M, §= 3,3...58 
and hence to the single matric equation 
(17) X’' — X = QM* 


where M* = M+ M+...+-M with m summands. Hence we may apply 
(5) and (5’) to obtain conditions equivalent to (16) expressed in terms of the 
invariant factors of M* and 
M* 
(. -Xj°* 


12. Example. Letting § be the rational field and $ the rational domain, we 
consider the algebra & having as a basis €; = €, €2, €3 With €2 €2 = €2, €3 €2 = €3, 
and €2:¢€; = €;¢; = 0. If we take as © the set of all a= aea+be6.+ Ce; 
where a, b, c are in $, we have a set of integral elements with the basis ¢€;, €2, €3. 


Using (8) we find 


a b c a b c 
R (a) =|0 a+b c}, S(a) =|0 a+b 0 ‘ 
0 0 a 0 0 a+b 


Taking a = (3,3, 1) E’, 8 = (1,5,2) E7, y = (0,0, 2) E7,¢ = (6, 2,12) E’, 
we will study 


(18) axB=y7, 
(19) ax 6B = y (mod (f)), 
(20) ax B = y (mod [f)), 


where ({] and [f) indicate, respectively, the left and right ideals generated by ¢. 
First we compute 


3 3 1\/1 5 2 3 33 12 30 0 
A = R7(a)S(8) =|0 6 140 6 O}=[0 36 6), E=(0 6 O}. 
0 0 3/\o 0 6 0 0 18 0 0 108 


When we find that A’ = (@) has e’; = 1, e’: = 6, e’s = 108, it follows from 
(12) that there is no solution to (18). 
Since S(¢) has the left-Hermite form 


244 40 0 
L={12 4 O}f, 
6 2 4 
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we find that (2) has e, = 1, e. = 2, es; = 12, and that (7) has e’; 
e’; = 12, so by (15) there is a solution of (19). 
Since R*™(¢) has the left-Hermite form 


- e's = 2. 


244 #0 O 
R=| 6 2 Of}, 
0 0 6 


we find that (4) has e; = 1, e. = 6, e; = 12, but that (4) has e’; = 1, e’: = 2, 
e’; = 12, so by (15) there is no solution to (20). 

When we carry through the actual solution of (19) we find that the most 
general solution involves three parameters: 

x= — 242, + 223, x2 = 222; om 223, x3; = l + 782; + 42. _ 1223. 
When we apply (17) we find that pairs of the solutions are congruent mod (f], 
if and only if 
2’; =2,(mod 4), 2’, = 22(mod2), 2; = 2z;(mod 3). 
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SOME SERIES OF PARTIALLY BALANCED 
INCOMPLETE BLOCK DESIGNS 


D. A. SPROTT 


1. Introduction. The use of incomplete block designs for estimating and 
judging the significance of the difference of treatment effects is now a standard 
statistical technique. A special kind of incomplete block design is the Partially 
Balanced Incomplete Block Design (PBIBD) introduced in (3). A PBIBD is 
an incomplete block design that obeys the following conditions: 

(1) there are } blocks of k distinct varieties each; 

(2) there are v varieties, each replicated r times; 

(3) given any variety, the remaining ones fall into sets of m, varieties each 
(i = 1,2,...,m) such that every variety of the ith set occurs A, times with 
the given variety, \, being independent of the given initial variety; 

(4) given any two varieties which are ith associates (that is, occur together 


A, times), the number of varieties which are jth associates of the one and kth 
associates of the other is 


Pint = Px} 
and is independent of the original pair of varieties. 


From the definition of a PBIBD, certain fundamental identities concerning 
the parameters of the design follow; for ease of reference, these are listed as 


(1.1) bk = rv, 

(1.2) v-l= > m6 
(1.3) r(k—1) = Dim 
(1.4) n,—1= > Pa, 
(1.5) ny = > Pa’, 
(1.6) niPa’ = ny; Pa’. 


In (3), a module theorem was proved by which it is possible in certain cases 
to construct the entire PBIBD by adding elements of a module to a given 
initial block. The purpose of the present paper is to generalize the module 
theorem of (3) to the case v ~ b, and then to apply the methods of (6) to 
obtain some general series of PBIBD’s. 
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2. The general module theorem. 


THEOREM 2.1. Let M be an additive abelian group (module) consisting of n 
elements x, and suppose that there exist m blocks B,, Bs, ..., By such that: 


(1) every block contains k distinct elements; 
(2) among the mk(k — 1) differences arising from the m blocks, just n, of the 
non-zero elements of M are repeated i, times; we may denote these n, elements by 


t | ft. 
Oy» Aa, +++ 5 An; ’ 


(3) among the n,(n, — 1) differences 
ay! — ay (u,w=1,2,..., 234 ¥ w) 
every element of the set 


e e 
Gy » M2 5+ ++ 5 An, 


occurs p,,;° times, while among the n,n, differences 


ai — a (wu = 1,2,...,%50 = 1,2,..., 25), 
every element of the set 
ay", a2", ae | n,° 
occurs p,;° times. 
Then we can form blocks 
Bus (g = 1,2,...,m;0€ M) 


of elements x’, where x’ = x + 6 (x ranging over the elements of B,). Since there 
are m initial blocks and @ assumes n values, we thus get mn blocks By» (By. = B,), 
and these blocks form a PBIBD with parameters 


v=n,b = mn,r = mk, k; 1,4; Pi = (Pp') (¢ = 1,2,...,m). 


Proof. The proof of Theorem 2.1 is omitted, since it is an exact parallel 
of the proof given in (3) for the special case in which m = 1, v = b. 

In the series which we shall derive from Theorem 2.1, it will be convenient 
to reserve the letter p to denote a prime; M will always be a Galois Field 
GF (p*), and x will always denote a primitive element of M. 


3. Series with 4m(4\+1) + 1 = p*. 
THEOREM 3.1. Ifv = 4m(4X + 1) + 1 = p‘, and if among the 2d expressions 


4m 


x? —1 = x" (s = 1,2,..., 2A) 
there are \ + a even and  — a odd powers of x, then the design with parameters 
v= 4m(4.+1)+1, '= mv, r=m(40.4+ 1), k=4A4+1; 
my =m, = 2m(44+1), AT=A+a, Ac=A—<G; 


_ (4@-—5) 3@-1 aGe-% 4(v — 1) 
P= (i073 t@—1))' **™ \a@—1) i-- 8) 
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can be constructed from the initial blocks 


(x, ett — x2t+16hm) 


where i ranges from 0 tom — 1. 


Proof. As in (6, Theorem 5.1), we find, setting d = 44 — s + 1, that 


x" -_ gett t2mte— s) ‘ 


Divide the elements of GF (v) into two sets; set 1 will be composed of elements 
of the form x*‘, and set 2 will be composed of elements of the form x*‘+!. The 


differences 
x* latinas” x? 1+ 2m(4A—28+ StOrtG0 


for s fixed, range over set 1 or set 2 according as g, is even or odd. Hence, as 
$ ranges, each element of set 1 occurs A; = A + @ times and each element of 
set 2 occurs \2 = A — a times. Also, the number of elements in set 1 = the 
number of elements in set 2 = $(v — 1) = 2m(4\ + 1). Thus condition 2 of 
Theorem 2.1 is satisfied and m; = mz. = 2m(4d + 1). 

The differences between elements of set 1 have the form 


ttm yt (ye _ 1) (¢ = 0,1,...,3(0 — 3); 4 = 1,2,..., }(0—8)). 


As t ranges, these differences cover set 1 or set 2 according as x™ — 1 is an even 
or an odd power of x. Hence, if y of the expressions x™ — 1 are odd powers of x, 
we have 


Pu? = y¥, pu’ = $(v — 3) — ». 
The differences between elements of set 2 have the form 
tte an x2ttl = e2t+l (4 —_ 1). 


These cover set 1 or set 2 according as x™ — 1 is an odd or even power of x. 
Hence 


Po! = y, po® = $(v — 3) — 9, 
and condition 3 of Theorem 2.1 is satisfied. 
Using the identities 1.4 and 1.5, we obtain 


Pi’ + pi! = my — 1 = ¥(0 — 3), pis! 


y, 
and 
pa! + po’ =m = 3(0—1), pu' = $(2— 1) - ». 
Thus y = 3(v — 1) = pis’ = pa’; therefore P; and P, are as stated in the 
theorem. 


If c = 0, the resulting series is the completely balanced series given in 
(6, Theorem 5.1). 


Example. If m = 1, = 3, we take x = 2 and obtain a design for 53 
varieties in blocks of 13 by adding the integers modulo 53 to the initial block 


(1, 16, 44, 15, 28, 24, 13, 49, 42, 36, 46, 47, 10). 
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THEOREM 3.2. Jf v = 4m(4\ — 1) + 1 = p* and if among the 2. — 1 
expressions 


4m 


x? —1 = x (s=1,2,...,2A— 1) 


there are } — a even and \ +- a — 1 odd powers of x, then the design with para- 
meters 


v= 4m(4—1) +1, 6 = mv, r = 4m, k = 4d; 
my = me = 2m(4X—1), AL =A—atl, Aw=At+a-—1; 
P;, P2 (cf. 3.1) 


can be constructed from the initial blocks 
(0, xt, titan =e ettt+er(O—2)) | 
where i ranges from 0 tom — 1. 


Proof. The differences involving the zero element are all distinct and cover 
the even powers of x once. Hence, from Theorem 3.1, A, = A —a+1, 
Ae =A+a-—1; also m, = nz = $(v — 1) = 2m(4A — 1). Pi and Pz are 
obtained as before, and have the same form. 

If a = 1, a completely balanced series is obtained. 


4. Series with 2am(2a\+1) + 1 = p*. 


THEOREM 4.1. Jf v = 2am(2ad\ + 1) + 1 = p*, and if among the exponents 
qs, where 


xf = x — | (s = 1,2,...,aA), 
the residue class of (1 — 1) modulo a is represented d, times, then the design with 
parameters 
v = 2am(2ad + 1) + 1, 6 = mv, r = m(2ad + 1), k = 2ad + 13; 
n, = 2m(2adrd + 1), AG; Ps, 
can be constructed from the initial blocks 


+ 2 + 4 +4ma*h 
(x™, x ad x am x ma ) 


,* ee, ’ 


where u ranges from 0 to m — 1. The p;} are the number of expressions 


gett — | = x? 


where z = k — j (mod a), and t ranges from 0 to (v — a — 1)/a. 


Proof. The differences are 


au+2arm+¢e au+am(2r+2ad\+1—23)+¢, 
x »x ° 


Let set i be the set of powers which are congruent to (i — 1) modulo a. There 
are a such sets; hence m, = (v — 1)/a = 2m(2a\ + 1). Also, since each ele- 
ment of set i occurs among the differences \, times, condition 2 of Theorem 2.1 
is satisfied. 
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The number of times that every element of set k occurs among the differences 

in (set i — set j) is p,;*; but (set i — set j) consists of elements of the form 
gttteut+i—1 a= geets—1 = thelial a= 1) = geets—its 
where 
xttei — 1 = x*, 
These elements are in set & if and only if 
j-—1l+2z2=k-—1 (moda), 
that is, 
z=k-—j (moda). 
The work of finding the p,,;* can be simplified by noting that 


x” - 2am ad+)—et aie (x** = 1) im taht )—e8 an ge tem(2ed+1)—e8 


that is, z = 2; (mod a). Hence we need the expressions 
x? = x! — ] 


only for ¢ = 0,1,..., (v — 1)/2a. The residue class of z corresponding to a 
given ¢ is represented twice if ¢ < (v — 1)/2a and is represented once if 
t = (v — 1)/2a. 


Example. Take a = 3, m = = 1; then v = bd = 43, r = k = 7, n; = m2 
= nm; = 14. From the equations in GF (43) 

3? — 1 =3'", 3°—1 = 3%, 3° —1=_3", 32%—1=3, 31° — 1 = 3%, 
313-1 = 3%, 377 ~ 1 = Bf 
we deduce that the g, are 22, 1, and 23, that is, 1, 1, and 2 (mod 3). Hence 
A = 0,A, = 2, A; = Ss 

To find the p,,*, take i = 7 (mod 3); p,,* is the number of expressions 
x**— 1 = x*, where z=k-—i (mod3), and ¢=0,1,...,7. From the 
preceding tabulation of these expressions, it is seen that z = 0 once for t < 7 
and once for t = 7; z = 1 three times; z = 2 twice. Hence 


Pir’ = Poo? = pas*® = 3; Pir? = Poo? = Pas’ = 6; Pru? = Poo! = pas® = 4. 
The remaining p’s can be obtained from the relations 1.4, 1.5, and 1.6; thus 


3 6 4 64 4 4 4 6 
=|6 4 4], Px=|4 3 6], Ps=|4 6 4}. 
4 4 6 46 4 6 4 3 


In case a = 1, there is only one residue class modulo a; hence all the q, fall 
in this single residue class and there is only one A, = \. The resulting series is 
the completely balanced Series B of (6). 


P 


- 


THEOREM 4.2. Jf v = 2am(2ad — 1) + 1 = p*, and if among the exponents 
Qs, where 


xt = ym — 1, (s=1,2,...,@A—1), 
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the residue class of (i — 1) modulo a is represented d, times, then the design with 
parameters 
v = 2am(2ay\ — 1) + 1, 6 = mv, r = 2Zamd, k = 2ad; 
n, = 2m(2ad — 1), Ac + ba; Pi 
can be constructed from the initial blocks 


(0, x™, gtien panda goaut2em(26h—2)) | 
where u ranges from 0 tom — 1. 


Proof. Since the class of zero is represented once by all differences involving 
the zero element, the frequency of occurrence of the residue class of zero among 
the g, is A; + 1. The p,,* are found just as in Theorem 4.1. 


5. Series with am(ak\+1) + 1 = p*. 


THEOREM 5.1. Jf v = am(ad + 1) + 1 = p‘, where a is odd, and if among 
the exponents q,, where 


ome | me (s = 1,2,...,a@A) 


the residue class of (i — 1) modulo a is represented d, times, then the design with 
parameters: 


v = am(ad\+1)+1, d = mv, r = m(ak+1), k = adA+1; ny = m(ad+1), Ag; Py 


can be constructed from the initial blocks 


au autam _au+2am _au+ma*hr 
(0, % x — ), 


where u ranges from 0 tom — 1. 


Proof. The proof is similar to that of Theorem 4.1; the P; can be found as 
before, except that here, since a is odd, there are no relations among the gq, 
to simplify the work. 


THEOREM 5.2. If v = am(ad — 1) + 1 = p*, where a is odd, and if among 
the exponents q,, where 


ont — 1 = x (s = 1,2,...,a — 2) 


the residue class of (i — 1) modulo a is represented \, times, then the design with 
parameters 


v = am(ad — 1) + 1, b = mv, r = amy, k = ad; mn; = m(ad — 1), Ag + 2b; P; 
can be constructed from the initial blocks 


(0, gs. aioe” waa gtetem(eh—2)' 
where u ranges from 0 tom — 1. 


Proof. The proof is similar to that of Theorem 4.2. 
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6. Series with ma + 1 = p*. To every element y’ of GF(p*), let there 
correspond m varieties 


yn, 2, re | Ya! 


Then designs with three associate classes can be formed by taking as first 
associates of any variety 


all varieties 


Vu (fi # fe); 


Vw (u # w); 
the third associates are all varieties 


Ve". 


Thus the first associates of a variety are all varieties giving rise to pure dif- 
ferences; the second associates are all varieties giving rise to zero mixed 


differences; the third associates are all other varieties giving rise to non-zero 
mixed differences. 


THEOREM 6.1. If ma +1 = p*, then the design with parameters 


v = n(ma + 1), b = 4mv(n — 1), r = am(n — 1), k = 2a; 
n; = ma, 2g = n — 1, my = ma(n — 1), Ay = (a — 1)(m — 1), As = ma, 
A; =a — 1; 
ma — 1 0 0 
P,= 0 0 n—l1 ) 
0 nm—1 (ma — 1)(n — 1) 


0 0 ma 
P;= 0 n—2 0 ’ 


ma 0 ma(n — 2) 


0 1 ma — | 
P;= 1 0 n—2 
ma—-1%n—2 (ma — 1)(n — 2) 


can be constructed from the initial blocks 
(xf, 2.00%, Pee =,8+O-be, SLun', Su", 2 ele Sagi tO— be), 


where i ranges from 0 to m — 1, and (u, w) are the 4n(n — 1) pairs of integers 
selected from 1 to n. 


Proof. The pure differences of the type (u, ~), arising from all blocks with 
a fixed u and w, are each repeated a — 1 times (6, Theorem 2.1). Since a fixed 
u occurs with m — 1 values of w, A; = (m — 1)(a — 1). The zero mixed differ- 
ences of the type (u, w) occur a times for 7 fixed, and ma times in all; hence 











376 D. A. SPROTT 


As = ma. The non-zero mixed differences of the type (u, w) occur A; = a — 1 
times (6, Theorem 2.1). Since there are n(n — 1) pairs (u, w), 


b = 4m(ma+1) n(n — 1). 
The number of first associates of a variety y,“ is the number of varieties of 
the form 
He": 
thus 2, = ma. Also, nz is the number of varieties of the form 


eo"; 


that is, m2 = n — 1; m; = ma(n — 1) = the number of varieties of the form 
ts 


Yo - 
Two first associates have the form 
a”. yu"; 
hence );;' is the number of varieties of the form 
te" (fi * fs, fe # fs), 


that is, ma — 1. Also, p12! = p13' = p22! = 0. We obtain p2;' as the number 
of varieties of the form 

Vu" (f: fixed); 
hence p23! = » — 1. The number of varieties of the form 


Yu" (u ¥ w) 


gives us p33! = (ma — 1)(m — 1). This completes P; and the matrices P; and 
P; can be found in a similar way. 


THEOREM 6.2. If ma + 1 = p*, then the design with parameters 


v = n(ma + 1), b = m(n — 1)v, r = m(a + 1)(m — 1), Rk =a +1; 
ny = am, n, = n — 1,n3 = ma(n — 1), Ay = (a — 1)(m — 1), Ap = O, As = 2; 
P,, P2, P; (cf. 6.1) 


can be constructed from the initial blocks 
Ga", a0, eeep safte-be, Ow), 


where i ranges from 0 tom — 1 and (u,w) are the n(n — 1) permutations of the 
integers from 1 to n, taken two at a time. 


Proof. Each pure difference of the type (u,u) occurs a — 1 times for w 
fixed (6, Theorem 2.1) and (m — 1)(a — 1) times in all. The non-zero mixed 
differences of the type (u, w) occur twice, since u and w can be interchanged; 
the zero mixed differences do not occur at all. The m, and P, are found as in 
Theorem 6.1 and have the same values. 





Le 
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COROLLARY 6.21. If we permit u = w, we obtain a series with parameters 

v=n(am+1), b= mnv, r=mn(a+1), k=a+1; n, (cf. 6.2), 

Ay = n(a — 1) +2,A2 = 0,A3; = 2; P, (cf. 6.2). 

Proof. Each pure difference of the type (u, u) arises a + 1 times from the 
blocks in which u = w. 


COROLLARY 6.22. Ifm = a + 1 in Theorem 6.2 and the block (0;, 02, . . . , Oe+1) 
is included twice in the set of initial blocks, we obtain a group-divisible design with 
parameters 


v= (a+ 1)(am + 1), 5 = {m(a + 1) + 2} (am + 1), r = ma(a + 1)42, 
k=a+l1; 
g=a+1,h =am+1; \y = a(a — 1), A: = 2, 


where g is the number of groups in the GD design, h is the number of treatments in 
a group. 

Proof. To combine the second and third classes of a PBIBD, the following 
conditions must hold: 

(1) Ae = Az, 

(2) pis? + Pis® = Pis® + pis’, 

(3) Poo? + Pas® + pos? + Par? = Poo? + Pas* + pos? + par’. 
These conditions are satisfied; so classes 2 and 3 can be combined to give a 
PBIBD with the stated parameters. For this design, 


nm; = ma, nN, = (ma + 1)(n — 1); 


ma — 1 0 0 ma 
P, = ( 0 ae P= (0 (ma + 1)(a — » 


Coro.iary 6.23. Jf nm = a + 1 in Corollary 6.21 and the block (0, 02, . 
0.41) is included twice in the set of initial blocks, we obtain a GD design with 
parameters 

v= (am + 1)(a + 1),0 = {m(a + 1)? + 2} (ma + 1), 7 = (a + 1)*m4+2, 

k=a+l1; 

g=a+1,h=am+1; A, = a?+1,A2 = 2. 


Proof. Similar to that of Corollary 6.22. 


CoROLLARY 6.24. If in Theorem 6.2 the initial blocks are replaced by 
eo pO.) = we GO 1,... =f) 

where (u, w) are permutations of the integers from 1 to n, taken two at a time, then 
the resulting design is a GD design with parameters 

v = n(am + 1),b = m(n — 1)v,r = (a + 2)(m — 1)m,k = a+ 2; 

g=n,h=am+1; A. = (@+ 1)(m—1),A. = 2 

Proof. Here each zero mixed difference of the type (u, w) also occurs twice; 
hence classes 2 and 3 can be combined as in Corollary 6.22. 
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Example. Take a = 2, m = 1, in Corollary 6.23; the GD design has para- 
meters v = 9, b = 33, r = 11,k = 3; g = kh = 3; Ay = 5, Ap = 2. The initial 
blocks are (1,2,0), (1, 2,3), (1, 2,6), (4,5,0), (4, 5,3), (4, 5, 6), (7, 8,0), 
(7,8,3), (7,8,6), (0,3,6), (0,3,6), where we set y: = y, v2 = y + 3, 
¥s = y + 6. The three groups of three varieties are 0, 1, 2; 3, 4, 5; and 6, 7, 8. 
Thus, for instance, 3 occurs five times with 4 and 5 and twice with all the other 
varieties. 


7. Series with 2m(2A+1) + 1 = p*. 
THEOREM 7.1. Jf 2m(2\ + 1) + 1 = p*, then the design with parameters 


v = {2m(2A + 1) + 1}n, 6b = 3(m — 1)mv,r = m(2d + 1)(n — 1), 
k = 4. + 2; 

nm, = 2m(2. + 1), 22 = 2 — 1,m3 = m,(m — 1),A1. = (mn — 1)A, 

Ae = m(2X + 1), As = A; 


n,—1 0 0 
r=( 0 0 n—1 ) 
0 n—1 (nm, — 1)(m —1) 
0 0 1 
P,=|0 n-—2 0 ) 
ny O ,(n — 2) 


0 l m,—1 
P; = 1 0 n—2 
m—1l n—2 (nm, — 1)(n — 2) 


can be constructed from the initial blocks 


f t t i i i ) 
(xy » Xy —, see My a, XwsXw =. oe 9 Xy +o), 


where i ranges from 0 to m — 1 and (u, w) are the $n(n — 1) pairs of integers 
selected from | to n. 


Proof. By (6, Theorem 3.1), the pure differences of the type (u, u) occur 
X times for a fixed w and \; = (m — 1)A times in all. Similarly, the non-zero 
mixed differences of the type (u, w) occur A; = A times while the zero mixed 
differences of the type (u,w) occur (2A + 1‘ times for a fixed i, that is, A» 
= m(2 + 1) times in all. The method emvloyed in Theorem 6.1 gives the 
values for the , and shows that the P; have ihe same form as in that theorem, 
with ma replaced by 2m(2d + 1). 


Example. m = 2, = 3, = 1, gives a design with v = 39, b = 78, 
r= 12,k = 6; n,= 12, mz = 2, mz = 24, A, = 2, A. = 6,A3 = 1; 


li 0O 0 0 O 12 0 1 11 
P, = 0 0 2 ’ P, = 0 l 0 ’ P;= l 0 l ° 
0 2 2 i2 0 12 11 1 11 
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The initial blocks are 
(2°, 2,,**4, 2,,°**, 2%, ase, 2..°**), 
that is, reduced modulo 13, 
(i+ 1) (ly, 3u, 9... lo Seu» 9.), 1 = 0 or 1. 


The pairs (u,w) are the pairs (1,2), (1,3), and (2,3). Setting y, = y; for 
y = 0,1,...,12; ye = y + 13; ys = y + 26; we find that the initial blocks 
are 


(1, 3, 9, 14, 16,22), (1,3, 9, 27, 29,35), (2,6, 5, 15, 19, 18), 
(2, 6, 5, 28, 32,31), (14, 16, 22, 27, 29,35), (15, 19, 18, 28, 32, 31). 


The other blocks are generated by addition modulo 13; thus the first block 
generates (2, 4, 10, 15, 17, 23), (3, 5, 11, 16, 18, 24),..., (0,2, 8, 13, 15, 21). 
Given any variety, say 1, its first associates are 0, 2,3,..., 12; its second 
associates are 14 and 27; the remaining varieties are third associates. 


THEOREM 7.2. If 2m(2\ + 1) + 1 = p*, then the design with parameters 


v = {2m(2. + 1) + 1}n,b = (nm — 1)mv,r = m(2d + 2)(n — 1), 
k = 2d wee 2; my (cf. 7.1), At = (n = L)A, Ae = 0, As = i: P, (cf. 7.1) 


can be constructed from the initial blocks 
Ga, ™, .. 2 op Me™, @) 


where i ranges from 0 tom — 1 and (u, w) are the n(n — 1) permutations of the 
integers from 1 to n, taken two at a time. 


Proof. Proceed as in Theorem 6.2. 


Using proofs similar to those used in the Corollaries to Theorem 6.2, we 
obtain 


CoROLLARY 7.21. If we permit u = win Theorem 7.2, we obtain a series with 
v = {2m(2\ + 1) + 1}n, b = mnv, r = mn(2d + 2), k = 2A 4+ 2; 
n, = 2m(2\+1), mz = n—1, m3 = ny(m—1), Ay = MA+1, Ao = O,A;3 = 1. 
COROLLARY 7.22. If in Theorem 7.2 we set n = 2\ + 2 and include the block 
(01, O2, . . . , Oor42) 
among the initial blocks, then we obtain a GD design with parameters 


(2A + 2) {2m(2d + 1) + 1}, 


ll 


Vv 


b = {m(2d + 2)(2d + 1) + 1} {2m(2a + 1) + 1}, 
r= (21+ 2)(2\+ 1)m+1, k = 2.42; 
g = 2.+2, h = 2m(2. + 1) +1; i = AQAA 4+ 1D), A = 1. 
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CoROLLARY 7.23. If in Corollary 7.21 we set n = 2\ + 2 and include the 
initial block 


(0;, 02, . . . , Orr+2), 
then we obtain a GD design with parameters 
vy = (2A + 2){2m(2A + 1) + 1}, 
b = {m(2d + 2)? + 1} {2m(2d + 1) + 1}, 
r = (2X + 2)*m™+1,k = 21+ 2; 
g = 2\+2,h = 2m(2A + 1) +1; Ay = (2A + 2)A+1,r = 1. 


THEOREM 7.3. If 2m(2X — 1) + 1 = p*, then the design with parameters 


v = {2m(2\ — 1) + 1}n, b = 4mo(n — 1), 7 = 2m(n — 1)A,k = 4; 
= 2m(2\ — 1), m2 = n — 1,3 = 1,(n — 1), 


Ar = (m — 1)A, Ao = 2md, As = A; 


nn; — 1 0 0 
r=( 0 0 n— 1 ) 
0 n—1 (nm; — 1)(m —1) 
0 0 nN; 
r=(0 n—2 0 ) 
nN, 0 n(n — 2) 


0 1 m,—1 
P;= ] 0 n—2 
my —-1 nm—2 (mn; — 1)(m — 2) 


can be constructed from the initial blocks 


2 
| 


' ; ‘ - t ‘ 1+4(A—1 
(0,, Xu eee. ocog te +400 “a 4 ee ie me) 


where i ranges from 0 to m — 1 and (u, w) are the 4n(n — 1) pairs of integers 
selected from 1 to n. 


Proof. Proceed as in Theorem 7.1. 


THEOREM 7.4. If 2m(2\ — 1) + 1 = p*, then the design with parameters 


v = {2m(2A—1)+1}n, b = mv(n—1),7r = (2A+1)m(n—1), k = 2A+1; 
n, (cf. 7.3), A. = (m — 1)A, Ao = 2,A3 = 1; P;, (cf. 7.3) 


can be constructed from the initial blocks 
(0., Se", 2,79, seec®g 270, 0.), 


where i ranges from 0 to m — 1 and (u,w) are the n(n — 1) permutations of 
the integers from 1 to n, taken two at a time. 


Proof. Proceed as in Theorem 7.2. 


-—— 
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A CLASS OF ALGEBRAS WITHOUT UNITY ELEMENT 
R. M. THRALL 


1. Introduction. In a study of the commuting algebra of tensor space 
representations of the orthogonal group W. P. Brown encountered a class of 
algebras for which the existence of a unity element was equivalent to semi- 
simplicity, but which were of interest whether or not semisimple. He gave these 
algebras the name generalized-total matrix algebras and proved (2) that each 
such algebra was characterized by three integers /, 7, m and was isomorphic 
to the algebra of all square matrices of degree r + / + m which have zeros 
in the first / rows and in the last r columns. 

Let F be any field, let K be an extension sfield of finite degree k over F, let 
m be a positive integer and let /,r be non-negative integers. We denote by 
C = C(K,m,l,r) the F-algebra of order k(m + 1)(m + 1) consisting of all 
K-matrices having zeros in the first / rows and in the last /-columns. We call 
C a submatrix algebra. 

In the present paper we introduce a new family of algebras called algebras 
of class Q. These algebras are defined in terms of certain simple properties 
possessed by submatrix algebras. Our main result is a proof that each algebra 
of class Q is a factor algebra of a direct sum of submatrix algebras. We also 
touch on the topics of automorphisms, isomorphisms, and representations, 
of algebras of class Q. 


2. Algebras of class Q and class Q’. An F-algebra A (of finite dimension) 
is said to be of class Q if there exists an idempotent ¢ in A such that the fol- 
lowing three conditions hold: 

(Q:) B = «Ae is semisimple, 

(Q:) AeA = A, 

(Q;:) A = B+ N, where N is the radical of A. 

If instead of (Q:) we have the stronger condition 

(Q:’) B = «Ae is simple, 

then we say that A is of class Q’. 


It is easy to see that every submatrix algebra C(K, m, 1,1) is of class Q’; 
for we may take as the idempotent the matrix 


|| 0 0 0] 
(1) é=|0 I, 0 ! 
10 0 Of 


where the partitioning of rows and columns is given by /, m, r. 


Received October 12, 1954. Part of this research was carried out under a contract of the 
Office of Ordnance Research with the University of Michigan. 
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THEOREM 1. Let A be an algebra of class Q. Then ANA = 0. 


Since no non-zero element in a semisimple algebra can be in the radical we 
note that B (\ N = {0}; hence the sum B + N is direct (in the vector space 
sense). Now multiply A on left and right by ¢ and we get from (Q;) that 


(2) eNe = 0. 
Finally, we have 
ANA = AtANA&€A C AtNeA = 0. 
CorROLLarY 1. If A is of class Q then N* = 0 and AN? = N°A = 0. 
THEOREM 2. If A is of class Q then 


(3) A=B+c«N+ Ne+ N* (direct sum). 


We establish the theorem by identifying (3) with the Pierce decomposition 
of A relative to «. By (Q:), B = ¢Ae consists of all elements of A having « as 
two-sided unity. Next, suppose that a is an element of A for which a = ea and 
ae = 0. According to (Q;) we can write a = 6 + » where 8€ B and 7 € N. 
Now since a = ea we must have ey = », and then ae = 8 + ene = 8 requires 
8 = 0. This shows that ¢«N contains all elements of A having « as left unity and 
right annihilator; moreover, it follows from (2) that each element of «N has 
this property. We show similarly that Ne consists of all elements of A having 
« as right unity and left annihilator. 

This shows that the Pierce decomposition is 


(4) A=B+c«N+ Ne+ No (direct sum) 
where N, consists of all elements of A having ¢ as two-sided annihilator. All 


that remains is to show that Ny = N?®. It is a consequence of Corollary 1 that 
N? C No. Next, it follows from (Q2), (4), and the fact that N is an ideal that 
(5) A=A-A = (B+ Ne)(B+ €N) 
=B+ Ne+ «N+ NeN, 
where NeN C N? C No. Comparison of (4) and (5) then shows that NeN = No 
and hence 
N? = NeN = N,. 


3. Structure theory. In this section we show how a general algebra of 
class Q can be built up from algebras of class Q’ and in the following section we 
study the structure of algebras of class Q’. 


THEOREM 3. Let A, A;, and Az be algebras of class Q and let M be an ideal 
(two-sided) in A. Then the direct sum A, ® Az and the residue class algebra 
A — M are both of class Q. 


If ¢€, €1, €2 are the postulated idempotents in A, A;, Ax, respectively, then 
(€1, €2) and « + M, respectively, are idempotents which satisfy (Q:) and (Q2) 
in A, + Az:and A — M. It is also easy to check (Q;) in both cases. 
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THEOREM 4. Let A be an algebra of class Q. Then A = A* — M* where A* 


is a direct sum of algebras of class Q’ and M* is contained in the square of the 
radical of A*. 


It follows from Theorem 3 that A* — M* is of class Q if A* and M* satisfy 
the conditions of Theorem 4. 

Let A = B+ N where B and N are related to A as in (Q;), (Q2), (Qs), 
and suppose that 


is the (unique) expression of B as a direct sum of simple subalgebras. Let €, 
be the unity element of B, (¢ = 1,..., ); then 
(7) e=at...t¢¢ 


is an expression of ¢ as a sum of orthogonal idempotents in the centre of B. 
Now set 


(8) A,=AegA 6 @ i,..«ofh 
and 
(9) N,=A:°1\N (¢ = 1,...,p). 


LEMMA 1. A, is of class Q’ with idempotent «,, simple summand B,, and 
radical N,. 
The equation 
€:A,y €, = €;(AeA)e, = €eAMeeeAce, = €,Be,Be, = B,B,; = B, 
verifies (Q,’). 


Next, we have 
A; =) A.A; = A¢,AeAe,A — A¢é,A = Aw 


and hence A; = A ,€;A,. This verifies (Qz2). 
Finally, 


A, = (B+ N)e(B + N) = Be,B + (Ne.B+ Ne.N + Be,N) CB, + N, 
since A, and N, are ideals in A. But B; C Ay, N; C Ay, hence 
(10) A, =B,+N,. 


This sum is direct (in the vector space sense) since ¢, is unity element for 
B, and ¢,Nye;, € e,Ne, = €eNee, = 0; it follows that A, — N,=>B,, and 
hence the N, is the radical of A,. This establishes (Q3). 


LemMMA 2. A,A, = 0 tf i #j. 


If i # j we have 

¢Aee=e,Be,+eNe, 
Beg es + €, eNe €; 
=0+0; 


hence A ,A, = Ae, AAe,A = (). 


ll 








€4 
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id 
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We are now ready to prove Theorem 4. Let A* be the (ring) direct sum of 
Ai,...,Ap, i.e., A* consists of all p-tuples a* = (a:,...,a,) with a, in A, 
and with addition and multiplication done componentwise. Consider the 
mapping 
(11) T :a* = (a1,...,a)) ~a=ayt+...+ a). 
of A* into A. 

This mapping is clearly a linear transformation. It follows from Lemma 2 
that it is a ring homomorphism. It is “‘onto’’ since 


A*T =Ai+...+A,= Ag At+...+AegA =AA =A. 
Let M* be the kernel of 7, and let N* be the radical of A*. All that remains is 


to show that M* C (N*)*. Suppose that a* = (a:,...,«a,) lies in M*; 
(12) a=a+...+a =0. 
It follows from (3) that we can write 
(13) a= Be tatoact rs (¢=1,...,) 
where 8, = €; a; €4 4 = €4 4% f; = fo, e, and m, e; = ecfe = €¢ 7, = Tr ey = O. 

Now €,@ €; = By, €,a@ = By + 94, ae, = By + £;, and since a = 0 this gives 
By =n =f, = 0 (¢ = 1,...,p) and hence 

a® =m (7,..., T,) 


which is in (N*)*. 


4. Structure theory (continued). Theorem 4 gives the structure of algebras 
of class Q in terms of algebras of class Q’. In this section we refine this result 
by an analysis of algebras of class Q’. 


THEOREM 5. Every algebra of class Q’ is the homomorphic image of a sub- 
matrix algebra. The kernel of the homomorphism is contained in the square of the 
radical of the submatrix algebra. 


Let A be an algebra of class Q’ with simple summand B = ¢Ae. According 
to Wedderburn’s Theorem (1), B is isomorphic to a total matrix algebra over 
a finite extension sfield K of F. Let {x;,...,«,} be a basis for K over F, let m 
be the degree of B over K, and let e,, (i,7 = 1,...,m) be a matrix unit 
K-basis for B. 

Let ¢ = €,,; then K is isomorphic to K’ = eBe and every irreducible left-B- 
space W is isomorphic to eB. In particular, eW + 0 and there exists a vector w 
in W such that {w(= e:: w), ¢21 W,..., mi W} is a K-basis for W. Then the 
mk vectors , é€én w (kh = 1,...,k;%4 = 1,...,m) form an F-basis for W. 

€N is a left B-space and as such is a direct sum of irreducible left B-spaces. 
Hence, there exist vectors 1, ..., 7; in eN such that 


(14) {méng, (A= 1,...,8:¢21,...,m;8se1,...,8} 


is an F-basis for eN. 
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Similarly, there exist vectors {1, ..., ¢, in Ne such that 
(15) {fs ey Kn (A=1,...,k:;5f7 = 1,...,m;¢=1,...,7)} 


is an F-basis for Ne. 
Finally, N? = NeW is spanned by all products of basis vectors for Ne and 
€N;; i.e. by all products of the form 


04 1g Kn Kn C41 Ms- 


Next, we observe that the matrix units e,, commute with elements of K, 
hence 1; k, kx € is equal to zero unless 1 = j and then it is equal to ¢ xp xy e. 
Now {,¢ = {, and en, = 9,; hence N? is spanned by the kir products 


(16) {S,a9, (A=1,...,k3521,...,);¢=21,...,7f)}. 


In general these products will not be independent but will satisfy certain 
linear conditions 


(17) 2 aclt, h, s) fixan, = 0 (q = 1,...,g). 


We may suppose that these conditions are independent; then the F-dimension 
of N? is kir — g. 

We now let C = C(K,m,l,r). Let ¢ be the matrix given in (1), and let 
C = B’ + N’ be the decomposition (Q;) for C. Then we can choose matrix 
units ¢,,;/ for B’ and elements my’, . . . , 9; in é1:' N’, f1',..., £7’ in N’ e1:’ such 
that the k(m + 1)(m + r) elements 


(18) Knlptgs Kne'sMer S1e'rgk ny Seen 
BD @ Boo BES @ 1,20 MGS S 1,20 og BEB 1,2 0 


form an F-basis for C. Then the unique linear transformation of C onto A 
which sends each of these basis vectors into the corresponding unprimed ele- 
ment of A is clearly a ring homomorphism whose kernel lies in (N’)*. This 
completes the proof of Theorem 5. 

Next we combine the results of Theorems 4 and 5 and get our main structure 
theorem. 


THEOREM 6. Let A be an algebra of class Q. Then A = A* — M* where A* 


is a direct sum of submatrix algebras and M* is contained in the square of the 
radical of A*. 


5. Representation theory. Let A be an algebra of class Q and let V be a 
(left) representation space for A. Consider the chain V DAV D NAV 
> ANAV = 0. Clearly both the spaces V/A V and NA V are annihilated by 
every element of A, and AV/NAV is a completely reducible non-degenerate 
A-space. Hence, by suitable choice of basis vectors we get the following matrix 
form for the representation: 





wv Pe 








= 


et 
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0 0 0 
Vai (a) Vala) 0 
Vala) Vala) 0 


where V2 is in completely reduced form. We may suppose the basis elements 
so chosen that V22(e) is the identity matrix, and ifa = 68++{+7risa 
splitting of a according to (3) then 


Veo(a) = Vo2(8), Voila) = Vai(n), Va2(a) = Vae(f), 


(19) a- Via) = 














and 
Vai(a) = V(r). 


It is easy to show that if any of the integers /,, r, defined by the ideals A, 
of A given by (8) exceeds unity, then A has unbounded representation type 
(3). Consequently, it is not likely that there is any simple classification of the 
indecomposable representations of algebras of class Q. 


6. Uniqueness and automorphisms. Since the structure theorems depend 
on the decomposition A = B + N it seems desirable to study its uniqueness. 
Since N is the radical any lack of uniqueness must come from the semisimple 
summand B. But since B = Ae is uniquely determined by ¢ any second decom- 
position must correspond to a second idempotent ¢’. 

It is easy to verify for any m0, fo in eN, Ne, respectively, that 


(20) e = €+ no t+ fo + Som = (€ + fo) (€ + m0) 


is an idempotent for which Q;, Q2, and Q; hold. Moreover, if either 9 or fo is 
different from zero, B’ = ¢’Aé’ is not the same as B. Hence all we can expect 
for B is uniqueness to within an automorphism and this is established in the 
following theorem. 


THEOREM 7. Let A be an algebra of class Q and let ¢, e' be idempotents for 
which Q,, Q2, QO; hold. Then there is an automorphism T of A which sends « into ¢’. 
More precisely, if a = 8 -++{5-+ 1 is a splitting of a according to (3) then 
the mapping 

T:a-aT =a = Bl + ent+ite +r 
is an automorphism of A which sends « into ¢’. 


Both ¢ and ¢’ are mapped into the identity element of A — N under the 
natural mapping; hence ¢’ — ¢ is in N and so from (3) we get 


e = €+ m0 + £0 + 70. 


Now ¢ = (e’)? = € + 90 + fo + £070; i-e., ¢ has the form (20). Note in 
particular that ec’e = «. 
Next, we have 


eT = eee’ = (€ + £0) (€ + mo)e(e + fo) (€ + m0) 
= (e+ fo)(e + m0) = €. 
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Clearly T is a linear transformation and a direct computation shows that 
(a; a2)T = (a; T) (a2 T); ie., T is an endomorphism on A. 

To complete the proof that T is an automorphism, i.e. that it is one-to-one 
and onto we construct its inverse. Let a = 6’ + n’ + {’ + 7 be the splitting 
(3) for a relative to ¢’ and let aT’ = «8’e + en’ + {’e + r. Then the equations 
TT’ = T’T = I follow from ee’ec = € and eee’ = &’. 

The automorphism T of the theorem is completely defined by ¢«’ and hence 
by mo and fo; we denote it by T (mo, fo). It is easy to verify that the set W of all 
T(n, {) is a commutative group with composition rule 


(21) T (m1, £1) - T (ne, £2) = Tm + 2, £1 + £2). 


Let U, denote the subgroup consisting of all automorphisms of A which leave ¢ 
fixed; then the group G of all automorphisms has the factorization U.W. 

Let y, 7’ be elements of B for which yy’ = y’'y = e. Then the mapping S(y) 
given by 
(22) a=B+n+i+7r—-aS(y) = Br t+y¥at+ivt7 


is an automorphism of A. The set V of all S(y) is an subgroup of U. We 
observe that N? is elementwise fixed under the automorphisms in V and in W. 

According to Theorem 6, the general question of conditions for isomorphism 
of algebras of class Q can be reduced to the study of conjugacy (under auto- 
morphisms) of ideals contained in the square of the radical of a direct sum of 
submatrix algebras. 

In this paper we shall limit our study of isomorphism to the case of algebras 
of class Q’ and in particular those for which K = F, i.e. for which B is a total 
matrix algebra. According to Theorem 5 we can reduce this to the following 
question. Let C = C(F,m,l,r), and let M, M’ be two ideals in the square 
of the radical of C. We ask for necessary and sufficient conditions for the 
conjugacy of M and M’ under automorphisms of C. We first determine the 
group G of automorphisms of C which leave F elementwise fixed. 

We have initially the factorization G = U.W. Let Uz denote the subgroup 
of G consisting of automorphisms which leave B elementwise fixed and let T 
be any element of U,. Then since B is a total matrix algebra T must agree on 
B with an inner automorphism of B, i.e., there exists y in B such that T(y)~ 
leaves B elementwise fixed (1). This shows that U. = U,V. Now since W 
and V both leave N? elementwise fixed, M and M’ will be conjugate under G 
if and only if they are conjugate under Uz. 


THEOREM 8. Let G be the group of automorphisms of a submatrix F-algebra 
C = C(F, m, l, r). Then G has the factorization 


G = Uz; VW 


where W is isomorphic to a vector space of dimension m*lr over F, where V is 
isomorphic to the full linear group GL(m), and where Us is isomorphic to the 
direct product of GL(l) and GL(r). 
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The statement about W follows from (21) and the fact that for C we have 
dim «N = lm and dim Ne = rm. The statement about V follows from (22) 
since GL(m) is the group of inner automorphisms of a total matrix algebra of 
degree m. 

Next, let T be any element of Uz, and choose ¢ as in the proof of Theorem 5. 
Then, since e is in B, eT = e and hence (eN)T = eN and (Ne)T = Ne; more- 
over, T induces non-singular linear transformations JT, on eN and T, on Ne. 
Let r be an element of N*. Then we have (cf. (16)) 


(23) es i» Gish Ms- 

ts 
(The factor x, appearing in (16) does not appear here since K = F.) Now, 
(24) tT = 2) au($ Te) (nT 2); 


hence T is completely determined by 7, and Tx. It follows that the mapping 
T — (T,, Tg) is a homomorphism of Us, into GL(l) X GL(r). Moreover, it 
follows from (24) that if T— (J,,J,), then T = J, i.e. this mapping is an 
isomorphism. Finally we show that the mapping is onto. Let T,, JT, be any 
elements of GL(l) GL(r) respectively. Then the mapping T = 7(T z, Tp) 
defined by 

(25) (B+atetr)T =8+07,.+ fTe+7, 


where 7’ is given by (24), is clearly an element of Uz which maps into (T,;, Tx). 
Now let M be an ideal of C contained in N?, and let M have dimension g 
over F. Then (cf. (17)) M has a basis of the form 


(26) it, = De alt, s) Sm (qg=1,..-,8)} 


where the a,(t, s) are elements of F. We can associate M and the given basis 
with the trilinear form 


Sua: fu(x, y,2) = pe Ag (t, S) XV 


A change of basis for M replaces fy according to a non-singular linear trans- 
formation on the x,. Under an automorphism 7(T,, Tz), M is replaced by a 
new ideal M’ whose corresponding trilinear form is obtained from fy by apply- 
ing the substitutions 7, to the y, and 7, to the z,. Thus we see that ‘wo ideals 
M and M' are conjugate under Us, and therefore under G if and only if their 
corresponding trilinear forms f and f’ are equivalent. Thus the problem of iso- 
morphism of two algebras of class Q’ (having K = F) is reduced to the equi- 
valence of trilinear forms. 

To extend this result to the case where K # F would involve equivalence 
of quadrilinear forms under the full linear group on three of the sets of vari- 
ables and under a finite group corresponding to automorphisms of K over F 
on the fourth set of variables. If the centre of K is inseparable over F, then 
Theorem 8 still remains valid except that Ug, must be enlarged to account 
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for automorphisms of K. The factorization G = UgVW is no longer valid 
(unless the centre of K is inseparable over F). We leave the detailed analysis 
of this case as well as the general problem of isomorphism of algebras of class Q 
for future treatment. 
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ON THE MODULAR REPRESENTATION OF THE 
SYMMETRIC GROUP 


PART V 
G. pe B. ROBINSON 


1. Introduction. It has been observed (2) that the number of p-regular 
classes of S,, i.e. the number of classes of order prime to p, is equal to the 
number of partitions (A) of m in which no summand is repeated p or more 
times. For this relation to hold it is essential that p be prime. It seems natural 
to call the Young diagram [A] associated with (A) p-regular if no p of its rows 
are of equal length, otherwise p-singular. 

The problem considered here is that of refining the above result to prove 
(§5) that the number of regular diagrams in a given block is equal (1; 2; 4; 5; 6) 
to the number of modular irreducible representations (indecomposables of the 
regular representation of S,) in that block. It is interesting to see that all our 
machinery is required. For example, the notion of an r-Boolean Algebra 
associated with a given diagram (8), which seemed somewhat of a curiosity 
at first, plays a central role. In particular, complementation in such an rBA 
can be interpreted in terms of the core and so has significance for the block as 
a whole (§§2, 3). Similarly, the construction of a diagram with a given core 
from a knowledge of its p-quotient (star diagram) (3,7) has to be made 
explicit (§4). This shows up the underlying number-theoretic basis of the 
theory in a new and significant light. 

Actually, we are laying the foundation here for the establishment of an 
explicit correspondence between the indecomposables of the regular represen- 
tation and the modular irreducible representations of S,. The existence of this 
correspondence for any finite group was demonstrated by R. Brauer and 
C. J. Nesbitt in 1937, using very general arguments. 


2. The complement of a Young diagram in its r-Boolean Algebra. Consider 
a Young diagram [A] to (from) which can be added (removed) d(d*) nodes of 
class r. Such a diagram belongs to an r-Boolean Algebra (8) of dimension 
d + d* in which the complement of [A] is obtained by removing the d* r-nodes 
and adding r-nodes in the d free r-positions of [A]. Let us denote this uniquely 
defined complement by [XK]. As was shown in (8) 


2.1 d — d* = 4, 


so that the core [Xo] of [XK] is obtainable by adding 6 r-nodes to [Ao] where 6 
is the r-defect of [Ao]. Also, the weight of [X] is the same as that of [A]. Two 
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diagrams of the same block may belong to different rBA’s and d and d* may 
be different, but 6 is the same in each case. We prove the following theorem 


(cf. (8, 4.4)): 


2.2 The p-quotients of [\] and its complement [i] in the appropriate rBA are 
the same except for the interchange of the r and (r — 1)-constituents. 


Proof. We distinguish three cases. 

(i) If the ith row of [A] ends in a removable r-node and the jth column in an 
r-position (8) then h,,; = 0(mod p). Clearly, removing the r-node at the end 
of the ith row and adding an r-node at the foot of the jth column does not 
change the (i, j)-hook length so that h,, = h,, = 0 (mod p). It does however 
change the residue class of the hook from r to r — 1. Similarly, if the ith row 
ends in an r-position and the jth column in a removable r-node the length of 
the (i, 7) hook remains the same but its class is changed from r — 1 tor. 

(ii) If the ith row of [A] ends in a removable r-node and the jth column in an 
(r + 1)-node below which no r-node can be added as in Figure 1, 


6)....[=IT] 


rifr+l1 
Fic. 1 
then removing the r-node yields h, 1 = hy, = 0 (mod p). Similarly, if the 


th column ends in a removable r-node and no r-node can be added at the end 
of the ith row as in Figure 2, 














GZf...le—1 








r+1 


r 














Fic. 2 


then removing this r-node yields h,1,, = h;,, = 0 (mod p). Conversely, we 
may think of adding an r-node in the appropriate r-positions in Figures 1 and 2 
to yield similar conclusions. 

(iii) In this third case we have to consider (i, 7)-hooks such that mo r-node 
can be added or removed from the end of the ith row or the jth column. The 
situation is as indicated in Figure 3. 
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Taken in conjunction with case (ii) it is clear that if the r-constituent of 
[A], receives a contribution from h,, = 0 (mod p) in case (iii), then also the 
(r — 1)-constituent of [X], will receive a contribution from h441,,-1 = 0 (mod p) 
and vice versa. All three cases are concordant, in that an h = 0 (mod p) remains 
fixed in case (i), or moves one place in its row or column in case (ii), or diagon- 
ally in case (iii), so that the r and r — 1 constituents of [A], and [X], are 
interchanged, the others remaining unaltered. 


2.3 Example. If [A] = [7, 5, 4*, 2?, 1°] for p = 3, r = 1, thend = 2,d* = 2 
so that 


[AJs = (3%), (1), —, 


where the constituents of [A]; are associated with the residue classes 0, 1, 2 
respectively. We have [i] = [8, 6, 4*, 2, 1°] and 


[X]s = [1], [3*], —. 
Each of the changes described in cases (i), (ii), (iii) is illustrated. 


3. Regular Young diagrams. Consider the 0-element [A°] of an rBA for 
which d* = 0 and let us think of adding an r-node in each of the possible 

= § r-positions (8), by 2.1. It is possible that the addition of an r-node will 
lead to a singular diagram; if so, we shall call the corresponding r-position a 
singular position. 

Corresponding to a given singular position P there exists a regular position 
P’ at which an r-node can be added as indicated in Figure 4, the resulting 
diagram being regular. 























r r+1| 
r—1 Pr’ 
p — 1 rows 
r r+1} 
r—l Sd 











Fic. 4 
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Of course P’ may itself be a singular position as in the Example 3.1 (whose 
corresponding regular position is P’’ = (3, 3)). But the sequence of singular 
positions must eventually yield a regular position, since adding an r-node in 
the first row of [A] cannot yield a singular diagram. 

Clearly, adding all 6 r-nodes to [A°] yields its complement [A‘], and if [\°] is 
regular then so also is [A‘], since all singular positions and their corresponding 
regular positions are filled. A similar argument shows that [A‘] is singular if 
[A°] is singular, and conversely. 

When we consider an arbitrary element of an rBA for which 0<d* <d+d* =/ 
the situation is somewhat different, since if a regular position is occupied by an 
r-node, then in the complement the corresponding singular position will be 
occupied and the diagram will be singular. To overcome this difficulty we 
define a modified complement This definition favours regular diagrams, but a 
similar one would favour singular diagrams. 


DEFINITION. If in a regular diagram [A], a singular r-position is vacant 
while its corresponding regular position is occupied by an r-node, then the 
modified complement of [A] in the appropriate rBA is that diagram obtained 
from [i] by raising all r-nodes which occupy singular positions to the corre- 
sponding regular positions. Clearly, the modified complement of a modfiied 
complement is the original diagram. 


3.1 Example. If [A] = [7, 4,3, 2, 1°], then in the complement for p = 3, 
r = 0, [X] = [6, 5, 2°, 1?] the two singular positions (5,2) and (7,1) are 
occupied and the regular position (3, 3) is vacant. Thus the modified comple- 
ment is obtained by raising these 0-nodes to the corresponding regular posi- 
tions to yield (6, 5, 3, 2?, 1]. 


We may think of complementation in the ordinary sense as a special case 
of modified complementation, and so state the following theorem: 


3.2 The property that a diagram be regular is invariant under modified 
complementation in the appropriate rBA. 


We conclude that (cf. (7, 5.6)): 


3.3 The number of p-regular diagrams of a given weight w is independent of 
the core. 


Proof. Complementation or modified complementation amounts to adding 
5 r-nodes to the core. But we know that the result is again a core, and every 
core can be obtained in this way (8). 


4. The explicit construction of a Young diagram with given p-core and 
p-quotient. Our construction is explicit and merely reverses an argument 
given elsewhere (3). 

Let us call the set of first column hook lengths obtained from [Ao] a core 
set, denoting them 


4.1 3 Ci, Cay «2 = » Cry c;> Ci+1- 
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If these c’s are divided into residue classes, then it is known (8; 9) that the zero 
class is empty and that all residues smaller than the largest in a given class 
necessarily appear. 


It is in general necessary to extend I, in the following manner: 
4.2T, :b;=c14+5, be = Cots, ... de = Cots, begs = S—1,... 0-1 = 1,5, = 0, 


where k + s = h; we shall call I’, a basic set. Again we may divide the elements 
of I’, into residue classes, the zero class now appearing for h > k. 
We shall denote the p-quotient [A], by the set of disjoint diagrams 


4.3 [od], [,A], eees [1A], 


where one or more constituents may be vacuous. The partition corresponding 
to a given constituent may be written out in detail thus: 


4.4 [A] = [Aa rA2, ees Az], 


where r designates the residue class of the constituent. 

In the required construction of a diagram [A] we must extend the core set 
I, so that the quantities 
4.5 esd (¢ = 1,2,...,k,) 


can be added in order to the k, largest members in the appropriate residue 
class of T’,. It only remains to determine this residue class for all r, and we do 
this by setting 

4.6 r = b, — h (mod p). 


If we denote by g, the number of elements in a core set which are congruent 
to r (mod p), then the number of elements in the basic set I, which are con- 
gruent to r (mod ) is given by 





4.7 e(r.s) = eet [StP= t=" |, 
where the bracket function [x] denotes the largest integer equal to or less than 
x. For a given [A], we have a set of integers k, (r’ = 0,1,..., — 1), and 
the choice of s for a given core is determined by the following conditions: 
on alr, s) | >#e r’ % —h(mod p), 

| =k,,, r’ = —h(mod p), 


where r’ = r — h (mod p) by 4.6. That these conditions determine s uniquely 
(7) will be illustrated by the following 


4.9 Example. To construct [A] given [Ao] = [2, 1°], [A]s = [2, 1], [1?], [2] 
where the constituents of [A]; are associated with the residue classes 0, 1, 2 
respectively. Table I gives the values of the function g(r, s) and is arranged 
according to the residue classes r’ = r — h (mod p), for p = 3. 
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Table I 
g(r’, s) 

$s h 

ry’ =Q@ ry =2 ry =] | 
0 3 0 1 2 
1 4 0 2 2 
2 5 0 2 3 | 
3 6 1 2 3 
4 7 1 3 3 
5 8 1 3 4 
6 9 2 3 4 | 
7 10 2 4 4 | 
8 11 2 4 5 | 








The integers printed in bold type are those which correspond to the equality 
in 4.8. We can use the table to determine s. Clearly the k’s of [A]; are ko = 2, 
ke = 1, ki = 2, so that s > 6. That s > 6 follows from the equality part of 
4.8. Thus the basic set is 


10, 8, 7, 5, 4, 3, 2, 1, 0. 


Corresponding to r’ = 0 we must add 6 to 3 and 3 to 0; corresponding to 
r’ = 1 we must add 3 to 10 and 3 to 7; corresponding to r’ = 2 we must add 
6 to 8. Rearranging, we have the set of first column hook lengths: 


14, 13, 10, 9, 5, 4, 3, 2, 1, 
which belongs to [6?, 4?, 15]. 


5. The enumeration of p-regular diagrams. As we have remarked, a core 
set can have no element = 0 (mod p), and moreover every class of elements 
must contain every integer less than the largest in the class. If a diagram is 
singular then there will be at least p successive integers in the set of first column 
hook lengths. Conversely this condition is also sufficient for singularity. 


5.1 Amy diagram obtained by adding p-hooks to a p-core, while not increasing 
the number of rows, is necessarily regular. 


Proof. By adding multiples of p to a core set we can never introduce the 
zero residue class and so have p consecutive first column hook lengths. 

The enumeration problem of regular diagrams will be solved by showing 
how we can enumerate singular diagrams of a given weight w, provided the core 
is suitably chosen. Having established the enumeration in one case then, by 
3.3, it applies in all cases. The choice of the core affects the situation in a some- 
what subtle manner as we ca‘. see by examining first the case p = 2. Here 
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there is only one type of core and by taking the number of rows to be g we 
have the corresponding core set to be 


I, :2g — 1,2g — 3,...,3, 1. 


If we extend I, to I',,, where s is even it will always be possible to obtain a 
set of at least two consecutive terms provided w < g + 1, and the diagrams have 
non-vacuous [,A], where r = —g (mod 2). If w > g+ 1 then the diagram 
represented by 

[A] = [w], 


r = —g (mod 2), will have no two consecutive terms and so will be regular. 
Moreover, if s is odd all the terms in the extended set are even except some 
of those at the end so it will be necessary to add (from the top down) at least 
g + 2 multiples of 2 to obtain a set of first column hook lengths, i.e. w > g + 1. 
By denying singularity we obtain regularity so we have proved the following 
theorem: 


5.3 For p = 2, and w < g + 1 the necessary and sufficient condition that a 
diagram be regular is that in its 2-quotient the constituent [,A] be vacuous for 
—r =h = g (mod 2). 


For p > 2 the situation is complicated by the fact that there are p — 1 
classes of terms in a core set and some of these may be vacuous. Thus to 
produce a set of » consecutive first column hook lengths in the manner en- 
visaged above the weight w will be limited by the shortest residue class in the 
core set; we call the length g of this shortest class the grade of the core. If any 
class is vacuous, g = 0, and w = 1 is the only possible case yielding such an 
enumeration of singular diagrams. 

The following theorem includes 5.3 as a special case. 


5.4 If a p-core set T, contains p — 1 residue classes, each class containing 
at least g members, then the necessary and sufficient condition that a diagram 
[A1, Ao). -,Ae]l of weight w<g+1 be regular is that [,rA] be vacuous for 
—r =h=k (mod p). 


Proof. As before we fix attention on the singular diagrams. From our 
definition of the grade g, we know that there are at most g sets of p — 1 
consecutive residues in T',. The worst case we need consider is where [,A] = [w] 
so that s = p. Adding wp to zero we obtain I, which contains the consecu- 
tive set 


wp,wp —1,...,(w— 1)pt+1, 


so that [A] is singular. The other extreme case is where [,A] = [1”] and we add 
p to each of w terms which are all congruent to zero (mod p). In this case we 
have the consecutive terms 

wp, wp — 1,...,3, 2,1, 


and [A] is certainly singular. All other partitions of w clearly yield at least p 
consecutive terms in I,. 
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If the w nodes are distributed over more than one constituent of [A], then, 
provided [,A] is not vacuous, the above argument is still applicable and [A] 
must be singular. 

On the other hand if [,A] is vacuous, h = k (mod p), and since additions 
must be made at the top of a residue class of T',, w > g + 1 as in 5.3, proving 
that all diagrams under consideration must be regular. 

Denoting the number of partition of w, by p,, we combine 3.3 and 5.4 and 
conclude (1; 2; 4; 5; 6) that: 


5.5 The number of p-regular diagrams of weight w having a given core [Xo] 
is equal to 
p-l 
> Pw. Pw, -- + Dw,-, (Fw. -w0<w <u) 


and so equal to the number of modular irreducible representations of S, in the 
corresponding block. 


We prove in conclusion the following interesting result: 


5.6 Forw < g + 1 the diagrams in a given block are all zero elements (d* =0) 
of their respective rBA’s, where —r = k (mod p). 


Proof. if d* # 0 there is a removable node of class 7; this implies the pre- 
sence of a term of I’, congruent to zero (mod p), followed by a gap in the set. 
Since this does not happen for w < g + 1, with —r = k (mod p), we conclude 
that d* = 0 for every such diagram. 


Consider a core C,; of grade g having k rows. Clearly, the next succeeding 
node in the first column would be of class r where —r = k (mod p). If 5 such 
r-nodes are added to C, we obtain a core C2. Successive complementation in 
this manner yields a series of cores. For p = 3, 


(a) [2], [3, 1], [3, 17], [4, 2, 17], [4, 22, 1°],..., 
(b) zero, [1], [17], [2, 17], [2?, 17], [3, 27, 17],..., 
are two such series. We state the following lemmas without proofs. 
5.7 Two distinct series cannot have a core in common. 


5.8 The grade cannot decrease under complementation. 


It follows from 5.6 that if w < g+ 1 for diagrams having C; as core, then 
subsequent complementation with —r = k (mod ) is ordinary and the dia- 
grams so obtained have C2, C;, etc. as cores. Moreover, regular diagrams go 
into regular diagrams and singular into singular by 2.2, 5.4 and 5.8. Clearly, the 
critical class of the p-quotient which is vacuous for regular diagrams by 5.4 
is permuted according to the cycle 


(0,.p-—1,p —2,...2,1), 


under successive complementation in the series. 
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5.9 Example. To illustrate these ideas we give in Table II the sets I, 
for 3-singular diagrams with core [2*, 1*] for which T, = 5, 4, 2, 1 and g = 2, 
of weight w = 3, and the associated sets of first column hook lengths. The arrange- 
ment of these should be monotonic to construct [A], but they are left as they 
come after the appropriate multiples of p are added to the terms of I). 














| Table II 
3 Js r, First — a] 
-, -,{3] | 8,7,5,4,2,1,0 8,7, 5,4, 2,1,9 (3°, 2%, 1°] 
-, -,[2,1]| 11, 10,8,7,5,4,3, | 11,10,8,7,5,4,9, | [2% 15] 
2,1,0 2,1,3 
| -, -, [1°] | 14, 13, 11, 10, 8, 7, 6, | 14, 13, 11, 10, 8, 7, 9, | [2%, 1] 
5, 4, 3 2, 1,0 5, 4, 6, 2, 1,3 
(1), -, [2] | 8,7,5,4,2,1,0 8, 10, 5, 4, 2, 1,0 [4, 3, 2%, 12] 
(1], -, [12] | 11, 10,8,7,5,4,3, | 11,13,8,7,5,4,6, | (4,3, 1°) 
2,1,0 2,1,3 
[2], -, [1] | 8,7,5,4,2,1,0 8, 13, 5, 4, 2, 1,3 (7, 3, 15] 
{17}, -, [1] | 8,7,5,4,2, 1,0 8, 10, 5, 7, 2, 1,3 [4, 3%, 2, 14] 
-, [1], [2] | 8, 7,5, 4,2, 1,0 11, 7, 5, 4, 2, 1,6 [5, 24, 12] 
-, [1], [1°] | 11, 10,8, 7,5,4,3, | 14,10,8,7,5,4,6 | [5,2, 1°) 
2,1,0 2, 1,3 
-, [2], (1) | 8, 7,5, 4,2,1,0 14,7, 5, 4,2, 1,3 (8, 2, 1°] 
-, {17}, [1] | 8, 7,5, 4, 2, 1,0 11, 7, 8, 4, 2, 1,3 (5, 3, 14] 
[1], [1], [1] | 8, 7,5, 4,2, 1,0 11, 10,5,4,2,1,3 | [5%, 15] 
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PSEUDO-REGULARITY 
NATHAN DIVINSKY 


Introduction. An element x is said to be right-quasi-regular (r.q.r.) if there 
exists an element y such that x + y + xy = 0. This concept had its inception 
in the fact that (for rings with unity) if 1 + x has an inverse, written as 
1+y, then (1+ <x)(1+ y) = 1,%+y+ xy =0. Thus in rings without 
unity elements it seemed (1; 3; 12) profitable to consider this latter equation. 
Jacobson (9) was able to employ this concept in obtaining a structure theory 
for rings without chain conditions. 

Our point of departure is in considering the expression x + y + xy not as 
stemming from (1 + x)(1 + y), but as a special case of the more general 
expression x + x"y + x"*!y. Our considerations seem to bear most fruit in the 
case nm = 1 for commutative rings. We call an element x right-pseudo-regular 
(r.p.r.) if there exists an element y such that x + xy + x*y = 0. 

In §1 we show the existence of a maximal r.p.r. ideal R, called the subradical; 
and show that with some mild restrictions on the ring, it is simply the Jacobson 
radical J, thus obtaining a new representation of J. In general however, 
R < J. We also obtain some radical-like properties of R, as well as a definite 
relationship between R and J. 

In §2 we use the techniques of Brown and McCoy and in the commutative 
case are able to show that A — R is isomorphic to a subdirect sum of sub- 
directly irreducible rings, some of which are simple with unity (fields) and 
others are bound to their maximal nil ideal in the sense of Hall (8). 


1. An element x of a ring A shall be called right-pseudo-regular (r.p.r.) of 
degree n, if there exists an element y of A such that 
x + xy + x" ly = 0. 


It is clear that for » = 0 we get the familiar right-quasi-regularity. We shall 
be primarily interested in the case m = 1, and refer to it simply as r.p.r. It 
is also clear that if x is r.p.r. of degree m, then x is r.p.r. of degree n — 1 


etx. xy +x". xy = 0; 


and so in particular, if x is r.p.r. it is r.q.r. (right-quasi-regular). The converse 
of this last statement is not true, since in the ring of even integers modulo 4, 
the element 2 is r.q.r., since 


2+ 2+ 2.2 = 0(mod 4), 
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but it is not r.p.r. The exact relationship between right-pseudo-regularity and 
right-quasi-regularity is obtained in 


LEMMA 1. An element x of a ring A is r.p.r. of degree n, x + x"y + x"*y = 0, 
if and only if x is r.q.r. and there exists an element x’ such that x"x’ = x. 

Proof. If x is r.p.r. of degree nm, then clearly x is r.q.r. and x"x’ = x with 
x’ = —y — xy. Conversely if x is r.q.r., x + 2 + xz = 0, and if there exists 
an x’ such that x*x’ = x, then setting y = —x’ — x’z we find that 

x xty + xttly = x + x*(—x! — x'z) + x (— 2’ — x’2) 
=x—x— «13 — x* — x's 
= —x(z+x-+ xz) = 0. 


COROLLARY 1. An element x of a ring A is r.p.r. if and only if x is r.q.r. and 
there exists an element x’ in A such that xx’ = x. 


CoroLiLary 2. If x is in xA for every x of A, then right-quasi-regularity and 
right-pseudo-regularity are equivalent concepts. 


A more unexpected result is 


LEMMA 2. Right-pseudo-regularity of degree 2 and right-pseudo-regularity of 
degree n for all nm > 1, are equivalent concepts. 


Proof. Clearly, if x is r.p.r. of degree n, n > 1, it is r.p.r. of degree 2. Con- 
versely, if x is r.p.r. of degree 2, then by Lemma 1, <x is r.q.r. and there exists 
an x’ such that x*x’ = x. Notice that this is precisely strong regularity. Then 

Pk ae 2 , ty! gl 2 oe gf! —. 3 oe ,.. ete’ @ Z. 
Thus there is an element w = x’”"—' such that x"w = x. Therefore by Lemma 1, 
x is r.p.r. of degree n. 
A right ideal Q will be called r.p.r. of degree n if all of its elements are r.p.r. 


of degree n. To consider the existence of maximal r.p.r. of degree m right ideals 
we shall make use of the following 


LemMA 3. If x # 0, isr.p.r., x + xy + x*y = 0, then its right-pseudo-inverse 
(r.p.i.) y is not in the Jacobson radical J. 


Proof. lf y is in J, then y + xy is in J and there exists an element z such 
that 
ytuy tet (y+ xyz = 0. 
We have 
O=x+ xy + x*¥y + (x + xy + x*y/)z 

=x+x(y+ xy + 2+ yz + xyz) 

=x+x.0=x. 
This contradicts the fact that x ~ 0 and therefore y is not in J. 
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If we can show the existence of a maximal right ideal R,, which is r.p.r. of 
degree n, for every n, then by Lemma 2, R; = R; = ... = R,. This is in fact 
true but they are all equal to zero! 


THEOREM 1. Jf, for anyn > 1, Q is a right ideal all of whose elements are 
r.p.r. of degree n, then Q = 0. 


Proof. Let Q be r.p.r. of degree 2. If x is in Q then x + x*y + xy = 0. 
Then «x is r.p.r. with r.p.i. xy. But xy is in Q; and since Q is a right ideal all of 
whose elements are r.q.r., Q is in J, and xy is in J. This contradicts Lemma 3 
unless x = 0,0 = 0. 

Though Theorem 1 proves that there are no right ideals all of whose ele- 
ments are r.p.r. of degree n, nm > 1, there may be many elements which are 
r.p.r. of degree n. Let A be a division ring. Then every element # —1 is r.q.r. 
Furthermore x*x~' = x and thus by Lemma 1, every element # —1 is r.p.r. 
of degree 2 and thus by Lemma 2, every element # —1 is r.p.r. of degree n 
for every n. 

We shall now show the existence of a maximal r.p.r. ideal, which is not 
always zero. The first step is to show that the sum of two r.p.r. right ideals is 
again an r.p.r. right ideal. To this end we prove a slightly more general result 
akin to Kaplansky’s (10, Lemma 1). 


Lemma 4. If x is r.p.r. and if a belongs to an r.p.r. right ideal Q, then x + a 
is r.p.r. 


Proof. By Lemma 1, Corollary 1, it is sufficient to show that x + a is r.q.r. 
and that there exists an element v such that (x + a)v = x + a. The fact that 
x + is r.q.r. follows immediately from Kaplansky’s lemma, since x being 
r.p.r. is also r.q.r. and Q being an r.p.r. right ideal is an r.q.r. right ideal. 

To find the element v, we first define u = a — ax’ where xx’ = x. Since a 
is in Q, u is in Q, and there exists an element wu’ such that uu’ = u. Define 
v=x'+u' — x'u’. Then (x + a)v = x + a follows from xv = x and 


av = ax’ + (a — ax’)u’ = ax’ + uw’ = ax’ + u =a. 


We now define R to be the join of all the r.p.r. right ideals of the ring A. 
By Lemma 4, R is an r.p.r. right ideal. It is clear that R is the set of all ele- 
ments that generate r.p.r. right ideals, i.e., all x such that xi + xa is r.p.r. 
for every integer i and every element a of A. We now show that R is a two- 
sided ideal. 

Let x be any element in R and a be any element in A. We must show that 
ax is in R, i.e., that axi + axb is r.p.r. for every integer 1 and every b of A. 
Since x is in R, xi + xb is in R. Let y = xi + xb. Then it is sufficient to show 
that ay is r.p.r. for any y in R. Since R is a right ideal, ya is in R and therefore 
there exists a z such that ya + yaz + (ya)*z = 0. Then 


ay + (—ay — ayazy) + ay(—ay — ayazy) 
= ay — ay — a(yaz + ya + yayaz)y =~ 0—-—a.0.y = 0. 
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Therefore ay is r.q.r. Furthermore, since y is in R, y is r.p.r., there exists a y’ 


such that yy’ = y. Therefore ay . y’ = ay. Therefore, by Lemma 1, Corollary 1, 
ay is r.p.r. We have proved 


THEOREM 2. If A is an arbitrary ring, the join R of all the r.p.r. right ideals 
of A is an r.p.r. two-sided ideal. 


We shall call R the right subradical of A. Most of the time R < J, but 
using Corollary 2, Lemma 1 we have 


THEOREM 3. If x is in xA for every x of A, or of J, then J = R. 


By considering left-pseudo-regularity we could, by exactly the same tech- 
niques, prove the existence of a maximal I|.p.r. two-sided ideal L, which we 
call the left subradical of A. We would find that if A had a left unity, or if x 
was in Ax for every x of J, then J = L. It should be clear that though J 
enjoys certain left-right symmetric properties, the ideals R and L have no 
such well-roundedness. Of course if A has a unity element, then J = R = L; 
however in the general case, R and L are different. To see this, consider the set 
A of all two by two matrices of the form 


a 0 ) 
b O 
where a and 6 are integers mod 4. Then A contains 16 elements. The Jacobson 


radical J has 8 elements, namely those with a = 0 or 2, and 5 = 0, 1, 2 or 3. 
Furthermore, A has a right unity, 


1 0 ) 
0 Of; 
(in fact A has four different right unity elements) and therefore by Theorem 3, 


J = R. However A does not have a left unity and one can easily see that L 
has only the one element 
(3 9) 
0 OF; 


Though R is occasionally equal to J, it is often equal to 0. By Lemma 3, it 
is clear that if A is a radical ring, A = J, then R = 0. And also therefore, there 
is no nonzero ring which is equal to its right subradical. 

The right subradical R has the usual radical-like properties. 


THEOREM 4. The difference ring A — R is sub-semi-simple, that is, it has 
zero subradical. 

Proof. Let R be the subradical of A =A — R.IfZisin R, then there exists 
an element g in A such that + #9 + #9 = 0, that is, x + xy + x*y is in R. 
Then there exists an element z in A such that 


xt xy + x*4y + (x + xy + x*y)2 + (x + xy + x*y)*z = 0. 





hd 
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Rewriting this we have 


xt (x + x*)(y + 2+ ye + yxe + xyz + yxyt + yx*yz) = 0. 
Therefore x is r.p.r. Furthermore, since @ is in R, Zi + 24 is in R for every 
integer i and every Gin A. As for x, we can show that xi + xa is r.p.r. for 
every i and a, and therefore x is in R, # = 0, R = 0. 

Jacobson has shown (9) that J, = J(A,), where A, is the set of all » by n 
matrices with elements in A, J, is the set of all ” by m matrices with elements 
in J, and J(A,) is the Jacobson radical of A,. The corresponding result for 
subradicals is true and the proof is straightforward. 


THEOREM 5. The subradical R(A,) is equal to R,. 
LemMA 5. The subradical R = RA" for every integer n. 


Proof. Since R is an ideal, RA" < R. Conversely, if x is in R, there exists an 
element x’ such that x = xx’, x is in RA. Therefore R < RA, R < RA", 
R = RA". 


LemMA 6. The subradical R < M,, the intersection of all the maximal left 
ideals. 


Proof. Jacobson has shown (9) that 7. A < M,.SinceR < J.A,R < Mi. 


We shall now obtain a more definite relationship between J and R. 


LemMA 7. Let A be a non-nilpotent ring with the descending chain condition 
on right ideals, having all its idempotents in the centre. In particular A may be 
any commutative ring with d.c.c. Then if A" = A™*', A" has a unity element. 
In particular if A = A*, A has a unity element. 


Proof. By d.c.c. (2) there exists an idempotent e such that 
A=Ae+B 


where B is the set of all x — xe for x in A, and B is nilpotent. Furthermore 
Ae.B = B.Ae=0 since e is assumed to be in the centre. Therefore 
A? = (Ae)? + B?, 
A™ = (Ae)™ + B”. 

Since A > A? >...>A™>... is a descending chain of right ideals, there 
exists an integer m such that A* = A**'. It is clear that B* = B"*’, since if x is 
in B" then xe = 0. And since x = a + 3b, with a in (Ae)"*' and 3 in B**’, we 
have xe = 0 = ae + be = a + O. Therefore a = 0 and x = dD in B**'. There- 
fore B" < B"*' < B". Since B is a nilpotent ideal, B* = 0, A* = (Ae)*. But 
(Ae)" = A"e" = A*e. Then A* = A"e, e is a unity element for A". It is clear 
that e = e” is in A*. 


Lemma 8. If B is an ideal of A, then the radical of the ring B, J(B), is equal 
to J(\B, where J is the Jacobson radical of A. 
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This result is due to Perlis (13). 


Lemma 9. If B is an ideal of A, then R(B) < R(\B, where Ris the subradical 
of A. 


The proof uses Lemma 8 and is straightforward. Note that it is impossible 
to prove that R(B) = R/\B, since if we take B = J, R(B) = 0, whereas 
RO“B = RO\J = R. 


THEOREM 6. If A is a ring with d.c.c. on right ideals, having all its idem- 
potents in the centre, then R = JA"—', where n is the smallest integer such that 
A* = A*™*!. Whenn = 1,R = J. 


Proof. If A is nilpotent, A* = 0, R = JA*™—' = 0 by the remark just before 
Theorem 4. If A is not nilpotent, by d.c.c. there exists a least integer m such 
that A* = A*™*! ~ 0. Then by Lemma 7, A” has a unity element and by Theo- 
rem 3, R(A*) = J(A*). By Lemma9, R > R(A"). By Lemma 8, J(A*) = JOA". 
Therefore 

R > R(A*) = J(A*) = JNA" > JA™. 


Conversely, by Lemma 5, R = RA*—'. Thus R < JA*—'. Therefore R = JA*—". 
By similar techniques we can show that the left subradical L is contained 
in M,, the intersection of all the maximal right ideals, and that L = A*~' J. 


Discussion of Theorem 6. Theorem 6 is not true without d.c.c., as the 
following example, mentioned to the author in a discussion with Professor 
Zassenhaus, proves. Let x, be a basis for a commutative algebra, where the 
a's are real, 0 S a < 1. Define x. xs = xais if a + 8 < 1, and equal to 0 if 
a+ 68> 1. Then it is clear that every element is nilpotent. Thus A = J, 
and R = 0. However xq = Xj Xj, and therefore A = A®. To be sure, A is nil, 
but not nilpotent. 

Whether the theorem is true if A has d.c.c., but not the restriction that the 
idempotents lie in the centre, seems to be an open question. Since every ring 
with d.c.c. can be expressed (7) as A = M + M* where M is the maximal 
regular ideal, and M* is bound to its radical in the sense of Hall (8), and 
MM* = M*M = 0, the condition A = A? implies M* = M**. Thus the first 
step seems to be to decide whether there exists a ring, say B, with d.c.c., bound 
to its radical, without a right or left unity element and such that B = B?. 

Another attack on this question can be made using a technique due to 
Baer (4). Since we can write A = Ae + B, where B is the set of all x — xe 
with x in A, and with B in J, we embed Ae in a maximal left ideal F (we assume 
if necessary a.c.c.). Then A = (F, J). Since J is contained in every maximal 
modular’ left ideal, if F were modular, A = F, a contradiction. Thus either Ae 
is already A, in which case A has a right unity, R = J, and we are well away 
to proving Theorem 6; or F is not modular. The condition A = A?, together 


1A left ideal L is called modular if there exists an element e¢ in the ring A, such that xe — x 
is in L for every x of A. 





—/> 
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with d.c.c., imply that every maximal ideal is modular and that for every 
maximal left ideal, and in particular for F, A — F is an irreducible A-module. 
Then A — F is A-isomorphic to A — Q where Q is a maximal modular left 
ideal of A. It is not clear though that F must be modular. 

With regard to the following properties: 

(a) A has a right unity element; 

(b) For every x in A, x is in xA; 

(c) A = A?; 
it is interesting to observe that (a) implies (b), and (b) implies (c). Lemma 7 
proves that with d.c.c. and idempotents in the centre, (c) implies (a) and thus 
that the three conditions are equivalent. The above-mentioned example due 
to Zassenhaus shows that (c) does not imply (b) without d.c.c. To see that (b) 
does not imply (a) without d.c.c., consider the set of all infinite diagonal 
matrices, elements in a field, each matrix having only a finite number of 
nonzero entries. 


2. Following Brown and McCoy (5; 6), we associate with every element a 
in A, the ideal 


R'(a) = {ax — axe + DY ys, ae, — Dy a? 2}. 
We call an element a, R’-regular if a is in R’(a). We call an ideal J, R’-regular 
if every element of J is R’-regular. In this way we obtain the set R” of all 
elements that generate R’-regular ideals. The set R”’ is simply a special case 
of Brown and McCoy’s F-radical. If a is an element of the subradical R, then 
—a is also in R, 


—a—ab+a%=0, a =a(—d) — a*(—5), 


a is R’-regular. Thus it is clear that R < R”. In the commutative case R = R”, 
though in general they are different. From (5) we have the following important 
results about R”’. 


THEOREM 7. The set R” is an ideal of A. 
THEoREM 8. R”’(A — R”) = 0. 


THEOREM 9. The ring A — R” is isomorphic to a subdirect sum of sub- 
directly irreducible rings each having their R” = 0. 


THEOREM 10. A subdirectly irreducible ring A, has its R” = 0 if and only 
if there exists an element e ~ 0 in the minimal ideal K of A such that R'(e) = 0. 


THEOREM 11. A has its R” = 0 if and only if it is isomorphic to a subdirect 
sum of subdirectly irreducible rings each having their R” = 0. 


Let A be a subdirectly irreducible ring with R” = 0. Then by Theorem 10, 
there exists an element e ~ 0 in the minimal ideal K of A such that R’(e) = 0. 


Thus {ex — ex + Do ys, ez, — Dye? 24} = 0. 


Therefore ex = e*x for every x in A. Thus e? = ¢*, e® = e*, e? = e*. We will use 
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LemMA 10. [If there exists a nonzero idempotent e', both in the centre and in 
the minimal ideal K of a subdirectly irreducible ring A, then A is simple with eé’ 
as unity. 


Proof. Consider the Peirce decomposition of A for e’. A = A; + As, where 
A, = Ae’, and A: is the set of all x — xe’ for x in A and is the set of all x such 
that xe’ = 0. Since A: is an ideal which cannot contain e’, and since e’ is in 
every nonzero ideal, A; = 0. Therefore A = A, = Ae’. Since é’ is in K, 
A = K, A is simple (A? = Ae’. Ae’ contains e’ and is therefore not zero), 
and has e’ as unity element. 

Therefore if e? + 0 and ¢ is in the centre, in the subdirectly irreducible ring A 
with R” = 0, K contains a nonzero idempotent in the centre, and A is a simple 
ring with unity. Otherwise e? = 0, and then ex = 0 for every x in A. Thus 
eA = 0, JA = 0. We have proved 


THEOREM 12. If a ring A has all its idempotents in the centre, then it has 
R” = 0 if and only if A is isomorphic to a subdirect sum of subdirectly irredu- 
cible rings some of which are simple with unity, others (call them B,) have the 
property that K, B, = 0, where K, is the minimal ideal of B,. 


With some mild conditions on A we can remove the nonsimple components. 


TEEOREM 13. If a ring A has all its idempotents in the centre and either of 
the following properties: (a) if xA < I, then x is in I, for every ideal I; (b) every 
ideal M such that A — M is subdirectly irreducible, is modular; then A has R"” = 0 
if and only if A is isomorphic to a subdirect sum of simple rings with unity. 


Proof. Consider the components B, of Theorem 12, that are not simple, 
i.e., the ones that satisfy K, B, = 0. For every i, there exists an ideal M, in A 
such that A — M,=> B,. If A satisfies (a), then e, A < M, implies e,; is in 
M,. Thus if e, B; = 0 in B,, e; A < My, e; is in My, e, = 0 in B,. This con- 
tradicts the fact that e, ~ 0 in B,. Therefore there are no nonsimple com- 
ponents. Finally, if A satisfies (b), then A — M, has a unity element. There- 
fore e, B,; cannot be zero. 


We now turn our attention to the commutative case. Here R = R”. 

McCoy (11) made a study of all commutative subdirectly irreducible rings 
and considered them in two large classes: those having at least one element not 
a divisor of zero, and others all of whose elements are divisors of zero. We 
obtain more information about them in 


THEOREM 14. Jf A is a commutative subdirectly irreducible ring with sub- 
radical R zero, and with at least one element not a divisor of zero, then A is a field. 

If A is a commutative subdirectly irreducible ring all of whose elements are 
divisors of zero, then its subradical is zero and it is bound to its maximal nilideal 
N, and therefore also bound to its Jacobson radical J. Furthermore, if A has 
either d.c.c. or a.c.c., A is nilpotent. 
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Proof. By Theorem 12, a commutative subdirectly irreducible ring A with 
subradical R zero is either a simple ring with unity or eA = 0, e * 0. Therefore 
if A is fortunate enough to possess an element which is not a divisor of zero, 
eA #0, A is simple with unity, A is a field. 

From (11) we learn that if A is a commutative subdirectly irreducible ring 
all of whose elements are divisors of zero, then the minimal ideal K = (e,2e,... 
pe = 0) and eA = 0. Therefore KA = 0 and by Theorem 12, R = 0. Further- 
more, if A # N, the maximal nilideal, then let x be any element not in N. 
Since in the commutative case WN is the set of all nilpotent elements, x is not 
nilpotent, the ideal xA # 0, and the ideal x*A + 0. Since ¢ is in every nonzero 
ideal, e = x*y. Then 

(xy)? = x*y.y = ey = 0, 
since eA = 0. Therefore xy is nilpotent, xy is in N. Thus x. xy = e # 0, 
xN # 0. Thus if xN = 0, x must be in N, i.e. A is bound to N. Then of course 
A is bound to J, since if xJ = 0,xN = 0, x isin N < J. 

Assume now that A has d.c.c. If A is not nilpotent, and therefore not nil, 
there exists an element x which is not nilpotent. Consider the chain 
xA >x*A >..., and get an integer m such that x"A = x"*'A. Thus there is 
an element y such that x**! = x"*'y. Since x**'A is a nonzero ideal, ¢ is in it, 
e = x"*t!z, Then 

ey = 0 = xyz = xg = ce € 0. 
This contradiction shows that x must be nilpotent, A is nil and therefore 
nilpotent. 

Assume now that A has a.c.c. If A is not nil, then again let x be any non- 
nilpotent element. Then all the ideals x‘A are nonzero and therefore each of 
them contains e. Therefore 


€e= xy, = x72 =... = XY, =... 
Define V, to be the set of all z that annihilate x‘. It is clear that the V, are 
ideals and that V; < V2<...< V,i<....Since 


ex =Q0 = xy x = x'ty, 


¥: is in V441, but is not in V,. Therefore this is a properly ascending chain 

which does not stop after a finite number of steps. (Since x is not nilpotent, 

no V, is equal to the whole ring.) This contradicts a.c.c. and therefore A is 

nil. By a result of Zassenhaus as yet unpublished, which states that in the 

presence of a.c.c. the maximal nil ideal is nilpotent, A is nilpotent. 
Combining Theorems 12 and 14 we have our main result: 


THEOREM 15. If A is a commutative ring whose subradical R is zero, then A is 
isomorphic to a subdirect sum of subdirectly irreducible rings A,,A2,..., 
B,, Bs,..., where the A, are fields and the B, are bound to their maximal nil- 
ideals. If in addition, for every ideal M such that A — M is subdirectly irreducible, 
A — M satisfies either d.c.c. or a.c.c., and in particular if A satisfies d.c.c. or 
a.c.c., then the B, are nilpotent. 
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TWO REMARKS ON THE COMMUTATIVITY OF RINGS 
I. N. HERSTEIN 


In (1) and (2) we proved that under certain conditions a given ring R must 
be commutative. The conditions used there were “global” in the sense that 
they were imposed at once on the relation of a given element to all the other 
elements of the ring R. 

In this note we replace these global conditions by “‘local’’ ones that relate 
only to two elements of R at a time. We show that the results of (1) and (2) 
carry over to this situation. 

In (1) we proved that if in a ring R with centre Z, x*™ € Z for some integer 
n(x) > 1 for all x € R, then either R is commutative or its commutator ideal 
is a nil ideal. 

The condition x*™ € Z, of course, means that x*y = yx"® for all y € R. 
We prove here 


THEOREM 1. Suppose that R is a ring such that given any two elements x,y © R 
then for some integer n(x, y) > 1 which depends on both x and y 


rE Dy = syria, 
Then either R is commutative or its commutator ideal is a nil ideal. 


Proof. Suppose that R is not commutative. Let c # 0 be a typical element 
in the commutator ideal of R. We want to show that c is nilpotent. Since c is 
in the commutator ideal of R, 


c= > (a,b, — ba.) + > ri(de, — ed;) 


D @ 
+ p> (figs _ gids) Sit De ta(heck — kihy) uy. 


Let T be the subring of R generated by all the elements a,, b;, dy, es, fi Zu hes 
ki, Fs, Sz, ty, Uy appearing in the expression for c. Clearly c is a commutator in T. 
Suppose r € T. Then 


for suitable »; commutes with a, by the condition imposed on R. Similarly 
. Nins 
=T 


x 
ve = SE 
commutes with az for some integer m2; of course rz also commutes with a; since 
7; does. Continuing in this way we arrive at an integer m > 1 so thatr™ 
commutes with all the a’s, b’s, etc. appearing in the expression for c and which 
Received January 10, 1955. 
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generate 7. Thus 7” commutes with all the elements of T since it commutes 
with the generators of 7; that is, for every r € T, r"” is in the centre of T. 
By (1) this means that either T is commutative or its commutator ideal is a 
nil ideal. Since c # 0 is in the commutator ideal of T, T is not commutative. 
So ¢ is nilpotent and the theorem is established. 

In (2) we proved: let R be a ring with centre Z such that for all x € R 
x"*) — x € Z for some integer n(x) > 1; then R is commutative. The condi- 
tion x" — x € Z of course means that 


(x2 — x)y = y(x*™ — x) 
for all y € R. We localize the condition in the following 


THEOREM 2. If in a ring R for every pair of elements x and y we can find an 
integer n(x, y) > 1 which depends on x and y so that x"*” — x commutes with y, 
then R is commutative. 


Proof. Let T be the subring of R generated by x and y. Suppose ¢ € T. 
Thus for some integer m > 1, t; = #" — t commutes with x. For some other 
integer m > 1, t2 = t"; — t; commutes with y. Since ¢; commutes with x, f, 
also commutes with x. Thus ¢2 commutes with both x and y, and so with every 
element in the subring they generate. Thus ?, is in the centre of T. However 


te = ty" — ty = (™ — 1)" — (" — 1) = — (PP) — 2) 


where p(t) is a polynomial with integer coefficients. That is, for every t€ T 
we can find a polynomial p(t) with integer coefficients so that f*p(t) — ¢ is in 
the centre of T. By the principal result of (3) T must be commutative. Since 
both x and y are in T, xy = yx, and so R is commutative. 

The main theorem of (3) can also be generalized in the same fashion as the 
other two theorems. We state it without proof, 


THEOREM 3. If for every x and y in R we can find a polynomial p, ,(t) with 
integer coefficients which depend on x and y such that x*p,,(x) — x commutes 
with y, then R is commutative. 
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REMARKS ON FINITE GROUPS DEFINED BY 
GENERATING RELATIONS 


ROBERT FRUCHT 


The author wishes to make the following correction to his paper Remarks 
on finite groups defined by generating relations, Can. J. Math., 7 (1955), 8-17. 

The duplication principle stated on p. 8 holds only for those groups § 
which admit an automorphism taking 


Si, SiS2, SiSoSz, . . . . , SrSoS3 . . . Se 
into their respective inverses 
be (S,S2)—, (.S;S2S3), ooece (S,S2S3 oe @ S,)7. 


This fact (which had escaped the author's attention, and for whose discovery 
he is very much indebted to Graham Higman and B. H. Neumann) follows 
immediately from the consideration of that inner automorphism of @ which 
takes any element 7 of @ into 7,-'TT, = 7,TT7;; indeed for any m from 2 
to k+1 we have: 7;~-'(7;T,,)T:1 = (T:T,,)~', and 77, in © corresponds to 
S, S253... Sn—1 in §. 

If k = 2, by using 7, instead of 7, the following simpler condition results 
for a group § with 2 generators S, and S,: the duplication principle holds 
if, and only if, §& admits an automorphism taking 5, into S,~', and S, into 
S.-'. Hence it does not hold e.g. for the following group § of order 21: 

5:1S,S, = S;’, 5 = 1 


[in permutations: S; = (1 234567), S, = (2 3 5)(4 7 6)]; here G is easily 
seen to be of order 6 only instead of 42. 

It might be remarked that the duplication principle always holds for Abelian 
groups; whether it holds for a non-Abelian group depends not only on the 
group, but also on the generators. It holds e.g. for Dz (the dihedral group of 
order 6) when generated by 2 elements (as usually), or by the 3 permutations 
S,; = (12), S. = (13), S; = (123); but if De is generated by S, = (1 2), 
S, = (123), S; = (23), the group G is of order 4 only, instead of 12. 
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THE WEDDERBURN THEOREM 
HarrRY GOHEEN 


Mr. William Scott of the University of Kansas has kindly drawn my atten- 
tion to an error in my note, The Wedderburn Theorem, Can. J. Math., 7 (1955), 
60-62. The argument in lines 8-6 from the bottom of page 61 assumes that 
the division ring generated by an element which leaves a subring invariant 
and a subfield with the same property, also leaves that subring invariant. 
This assumption I have not proved, and it invalidates the proof of the theorem. 
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OVALS IN A FINITE PROJECTIVE PLANE 
BENIAMINO SEGRE 


1. Let $ be a finite projective plane (8, §17), i.e. a projective space of dimen- 
sion 2 over a Galois field y. We suppose that y has characteristic p # 2, hence 
order g = p", where ? is an odd prime and h is a positive integer. It is well 
known that every straight line and every non-singular conic of $ then contains 
q + 1 points exactly. 

Using the term oval to designate any set of g + 1 distinct points of 8 no 
three of which are collinear, we shall prove the following theorem, already 
surmised by Jarnefelt and Kustaanheimo (3) (deemed “implausible” in Math. 
Rev., 14 (1953), p. 1008): 


THEOREM I. If p ¥ 2, every oval of $ is a comic (i.e., can be represented by 
an equation of the second degree). 


This result fills up a gap in the finite congruence axiomatics set up by 
Kustaanheimo (4), and has important implications if we accept the idea, 
advanced by Jarnefelt (2), of a possible connection between the physical 
world and the geometry of a finite linear space (cf. also 1, 5, 6, 7). 


2. Let © denote any given oval of $, and B be an arbitrary point of €. 
Then @has a tangent at B, uniquely defined as the line of $ which contains B 
and no other point of ©; moreover, no three tangents of © meet at a point 
(7, Theorem 3). We begin by proving 


THEOREM II. Every inscribed triangle of € and its circumscribed triangle are 
perspective. 


It is not restrictive to identify the given inscribed triangle with the triangle 
of reference for homogeneous coordinates (x, x2, X3): 


A;:(1, 0, 0), A;:(0, 1,0), A;:(0, 0, 1); 
then we may denote by 
@1: X2 = kyx3, G2: X3 = koxy, G3: X1 = Rexre 


the tangents of @ at A;, As, A; respectively, where k;, k2, ks are three 
non-zero elements of the field y. If B:(c:, c2, cs) is any of the g—2 points of 
distinct from A;, A», As, then ¢;¢:cs; * 0; moreover, the lines A, B, Az B, 
A; B have equations of the form 


Xe = AiX%3, X3 = A2X1, X1 = Az Xo, 
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where the coefficients 1, A2, As are distinct from k;, ke, ks respectively, as well 
as from zero. Since these coefficients are given precisely by 
Ai = cas", Az =Coei', As = crc2’, 
they satisfy the equation 
(1) Ai Az As = 1. 

Conversely, if \; denotes any of the g —2 elements of + distinct from zero and 
from k;, the line x. = \,; x3 meets @ at A, and at a further point, B say, dis- 
tinct from A, As, As; hence the coefficients 2, A; in the equations x; = Az x1, 
x; = A3 X2 of the lines A; B, A; B are functions of A;, connected by (1), which 
take once each of the non-zero values of y distinct from ke, ks respectively. 
On multiplying the g—2 equations (1) thus obtained, we see that 

I? = ky ke ks, 
where II denotes the product of the gq—1 non-zero elements of y; whence 
(2) kikzk; = —1, 
as it is well known (8, §59) that I = —1. 
From the equation (2), Theorem II follows at once. In fact the points 
a2°* a3: (Rs, 1, ke ks), a3°a@: (ks ky, ki, 1), a@1°a2: (1, ky ke, k:) 
are joined to A;, Az, A; respectively by the lines: 
x3 = kokgx2, *1 = Rekixs, x2 = ky ke x1; 


by virtue of (2), these lines concur at the point K:(1, ki: ke, —k:2), which is 
therefore a centre of perspective of the triangles A; Az A; and a; a2 a3. 


3. We can now prove Theorem I. For this purpose we use the notation of §2, 
assuming, as it is not restrictive, that K coincides with the unit point (1, 1, 1); 
this is tantamount to supposing 


ki=k,- =k; = —1. 

If B:(cs, C2, C3) is any of the g—2 points of @ distinct from A;, A2, Az, we 

denote by 
b: bi X31 a box%e+ bs Xs =0 
the tangent of @ at it. This line contains B, but none of the points A;, As, As, 
2 - Gy, Az - 23, 2; - de; hence, if we put 
Bi = by —b2 —ds, Bo = —bi +502 — 53, Bs = —b; —b2 + dz, 

we have 


(3) by 6; + bo e2 + 303 = 0 
and 


(4) Db; be bs Bi Bo Bs ¥ O. 
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By virtue of Theorem II, the triangles BA, A; and ba, a; are perspective; 
this—as is immediately seen—is expressed algebraically by the equation 


C3 —Ce2 Cit C3 = a 
by —b; bs 0 = 0, 
b—b, 0 b | 


i.e., On suppressing the non-zero factor 6:: 
bo(ci + C2) = b3(c1 + 3). 


Likewise, the consideration of the inscribed triangles BA; A;, BA, A: and 
their circumscribed triangles gives: 


bs(c2 + €3) = Di(c2 +1), Bi(Cs + €1) = D2(Cs + C2). 
The last three equations imply: 
by : be 2 bs = (Co + €3) : (C3 + 1) : (C1 + C2); 
hence from (3), using also (4) and the hypothesis p ¥ 2, we deduce the equality 
Cots testi tcc, = 0. 
This equality means that each of the g—2 points B lies on the conic 
Xo X3 + X3X1 + 2X1 X2 = O. 


Since this conic obviously contains in addition the three points A, Ae, As, 
and its points are precisely g + 1 in number, thus @ must coincide with it, 
which proves Theorem I. 


4. We remark, in conclusion, that Theorem I does not hold on a finite plane 
of characteristic p = 2, if g > 4. For, as it is well known, the g + 1 tangents 
of a non-singular conic then meet at a point; this point and gq of the g + 1 
points of the conic constitute an oval, which, however, is clearly not a conic. 
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